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Prefaces 


Preface to the Second English Edition 


Science has not stood still in the years since the first English edition of this book 
was published. For example, Fermat’s last theorem has been proved, the Poincaré 
conjecture is now a theorem, and the Higgs boson has been discovered. Other events 
in science, while not directly related to the contents of a textbook in classical math- 
ematical analysis, have indirectly led the author to learn something new, to think 
over something familiar, or to extend his knowledge and understanding. All of this 
additional knowledge and understanding end up being useful even when one speaks 
about something apparently completely unrelated.! 

In addition to the original Russian edition, the book has been published in En- 
glish, German, and Chinese. Various attentive multilingual readers have detected 
many errors in the text. Luckily, these are local errors, mostly misprints. They have 
assuredly all been corrected in this new edition. 

But the main difference between the second and first English editions is the addi- 
tion of a series of appendices to each volume. There are six of them in the first and 
five of them in the second. So as not to disturb the original text, they are placed at the 
end of each volume. The subjects of the appendices are diverse. They are meant to be 
useful to students (in mathematics and physics) as well as to teachers, who may be 
motivated by different goals. Some of the appendices are surveys, both prospective 
and retrospective. The final survey contains the most important conceptual achieve- 
ments of the whole course, which establish connections between analysis and other 
parts of mathematics as a whole. 


'There is a story about Erdés, who, like Hadamard, lived a very long mathematical and human 
life. When he was quite old, a journalist who was interviewing him asked him about his age. Erdés 
replied, after deliberating a bit, “I remember that when I was very young, scientists established that 
the Earth was two billion years old. Now scientists assert that the Earth is four and a half billion 
years old. So, I am approximately two and a half billion years old.” 


vi Prefaces 


I was happy to learn that this book has proven to be useful, to some extent, not 
only to mathematicians, but also to physicists, and even to engineers from technical 
schools that promote a deeper study of mathematics. 

It is a real pleasure to see a new generation that thinks bigger, understands more 
deeply, and is able to do more than the generation on whose shoulders it grew. 


Moscow, Russia V. Zorich 
2015 


Preface to the First English Edition 


An entire generation of mathematicians has grown up during the time between the 
appearance of the first edition of this textbook and the publication of the fourth 
edition, a translation of which is before you. The book is familiar to many people, 
who either attended the lectures on which it is based or studied out of it, and who 
now teach others in universities all over the world. I am glad that it has become 
accessible to English-speaking readers. 

This textbook consists of two parts. It is aimed primarily at university students 
and teachers specializing in mathematics and natural sciences, and at all those who 
wish to see both the rigorous mathematical theory and examples of its effective use 
in the solution of real problems of natural science. 

Note that Archimedes, Newton, Leibniz, Euler, Gauss, Poincaré, who are held 
in particularly high esteem by us, mathematicians, were more than mere math- 
ematicians. They were scientists, natural philosophers. In mathematics resolving 
of important specific questions and development of an abstract general theory are 
processes as inseparable as inhaling and exhaling. Upsetting this balance leads to 
problems that sometimes become significant both in mathematical education and in 
science in general. 

The textbook exposes classical analysis as it is today, as an integral part of the 
unified Mathematics, in its interrelations with other modern mathematical courses 
such as algebra, differential geometry, differential equations, complex and func- 
tional analysis. 

Rigor of discussion is combined with the development of the habit of working 
with real problems from natural sciences. The course exhibits the power of con- 
cepts and methods of modern mathematics in exploring specific problems. Various 
examples and numerous carefully chosen problems, including applied ones, form 
a considerable part of the textbook. Most of the fundamental mathematical notions 
and results are introduced and discussed along with information, concerning their 
history, modern state and creators. In accordance with the orientation toward natural 
sciences, special attention is paid to informal exploration of the essence and roots of 
the basic concepts and theorems of calculus, and to the demonstration of numerous, 
sometimes fundamental, applications of the theory. 

For instance, the reader will encounter here the Galilean and Lorentz transforms, 
the formula for rocket motion and the work of nuclear reactor, Euler’s theorem 


Prefaces Vii 


on homogeneous functions and the dimensional analysis of physical quantities, the 
Legendre transform and Hamiltonian equations of classical mechanics, elements of 
hydrodynamics and the Carnot’s theorem from thermodynamics, Maxwell’s equa- 
tions, the Dirac delta-function, distributions and the fundamental solutions, convo- 
lution and mathematical models of linear devices, Fourier series and the formula 
for discrete coding of a continuous signal, the Fourier transform and the Heisenberg 
uncertainty principle, differential forms, de Rham cohomology and potential fields, 
the theory of extrema and the optimization of a specific technological process, nu- 
merical methods and processing the data of a biological experiment, the asymptotics 
of the important special functions, and many other subjects. 

Within each major topic the exposition is, as a rule, inductive, sometimes pro- 
ceeding from the statement of a problem and suggestive heuristic considerations 
concerning its solution, toward fundamental concepts and formalisms. Detailed at 
first, the exposition becomes more and more compressed as the course progresses. 
Beginning ab ovo the book leads to the most up-to-date state of the subject. 

Note also that, at the end of each of the volumes, one can find the list of the main 
theoretical topics together with the corresponding simple, but nonstandard problems 
(taken from the midterm exams), which are intended to enable the reader both de- 
termine his or her degree of mastery of the material and to apply it creatively in 
concrete situations. 

More complete information on the book and some recommendations for its use in 
teaching can be found below in the prefaces to the first and second Russian editions. 


Moscow, Russia V. Zorich 
2003 
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Preface to the Sixth Russian Edition 


On my own behalf and on behalf of future readers, I thank all those, living in dif- 
ferent countries, who had the possibility to inform the publisher or me personally 
about errors (typos, errors, omissions), found in Russian, English, German and Chi- 
nese editions of this textbook. 

As it turned out, the book has been also very useful to physicists; I am very 
happy about that. In any case, I really seek to accompany the formal theory with 
meaningful examples of its application both in mathematics and outside of it. 

The sixth edition contains a series of appendices that may be useful to students 
and lecturers. Firstly, some of the material is actually real lectures (for example, 
the transcription of two introductory survey lectures for students of first and third 
semesters), and, secondly, this is some mathematical information (sometimes of cur- 
rent interest, such as the relation between multidimensional geometry and the theory 
of probability), lying close to the main subject of the textbook. 


Moscow, Russia V. Zorich 
2011 


Preface to the Second Russian Edition 


In this second edition of the book, along with an attempt to remove the misprints 
that occurred in the first edition,” certain alterations in the exposition have been 
made (mainly in connection with the proofs of individual theorems), and some new 
problems have been added, of an informal nature as a rule. 

The preface to the first edition of this course of analysis (see below) contains a 
general description of the course. The basic principles and the aim of the exposition 
are also indicated there. Here I would like to make a few remarks of a practical 
nature connected with the use of this book in the classroom. 

Usually both the student and the teacher make use of a text, each for his own 
purposes. 

At the beginning, both of them want most of all a book that contains, along with 
the necessary theory, as wide a variety of substantial examples of its applications 
as possible, and, in addition, explanations, historical and scientific commentary, and 
descriptions of interconnections and perspectives for further development. But when 
preparing for an examination, the student mainly hopes to see the material that will 
be on the examination. The teacher likewise, when preparing a course, selects only 
the material that can and must be covered in the time alloted for the course. 

In this connection, it should be kept in mind that the text of the present book 
is noticeably more extensive than the lectures on which it is based. What caused 


2No need to worry: in place of the misprints that were corrected in the plates of the first edition 
(which were not preserved), one may be sure that a host of new misprints will appear, which so 
enliven, as Euler believed, the reading of a mathematical text. 
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this difference? First of all, the lectures have been supplemented by essentially an 
entire problem book, made up not so much of exercises as substantive problems of 
science or mathematics proper having a connection with the corresponding parts 
of the theory and in some cases significantly extending them. Second, the book 
naturally contains a much larger set of examples illustrating the theory in action than 
one can incorporate in lectures. Third and finally, a number of chapters, sections, or 
subsections were consciously written as a supplement to the traditional material. 
This is explained in the sections “On the introduction” and “On the supplementary 
material” in the preface to the first edition. 

I would also like to recall that in the preface to the first edition I tried to warn 
both the student and the beginning teacher against an excessively long study of 
the introductory formal chapters. Such a study would noticeably delay the analysis 
proper and cause a great shift in emphasis. 

To show what in fact can be retained of these formal introductory chapters in 
a realistic lecture course, and to explain in condensed form the syllabus for such 
a course as a whole while pointing out possible variants depending on the student 
audience, at the end of the book I give a list of problems from the midterm exam, 
along with some recent examination topics for the first two semesters, to which this 
first part of the book relates. From this list the professional will of course discern the 
order of exposition, the degree of development of the basic concepts and methods, 
and the occasional invocation of material from the second part of the textbook when 
the topic under consideration is already accessible for the audience in a more general 
form. 

In conclusion I would like to thank colleagues and students, both known and un- 
known to me, for reviews and constructive remarks on the first edition of the course. 
It was particularly interesting for me to read the reviews of A.N. Kolmogorov and 
V.I. Arnol’d. Very different in size, form, and style, these two have, on the profes- 
sional level, so many inspiring things in common. 


Moscow, Russia V. Zorich 
1997 


From the Preface to the First Russian Edition 


The creation of the foundations of the differential and integral calculus by Newton 
and Leibniz three centuries ago appears even by modern standards to be one of the 
greatest events in the history of science in general and mathematics in particular. 

Mathematical analysis (in the broad sense of the word) and algebra have inter- 
twined to form the root system on which the ramified tree of modern mathematics 
is supported and through which it makes its vital contact with the nonmathematical 
sphere. It is for this reason that the foundations of analysis are included as a neces- 
sary element of even modest descriptions of so-called higher mathematics; and it is 
probably for that reason that so many books aimed at different groups of readers are 
devoted to the exposition of the fundamentals of analysis. 
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This book has been aimed primarily at mathematicians desiring (as is proper) to 
obtain thorough proofs of the fundamental theorems, but who are at the same time 
interested in the life of these theorems outside of mathematics itself. 

The characteristics of the present course connected with these circumstances re- 
duce basically to the following: 


In the Exposition Within each major topic the exposition is as a rule inductive, 
sometimes proceeding from the statement of a problem and suggestive heuristic 
considerations toward its solution to fundamental concepts and formalisms. 

Detailed at first, the exposition becomes more and more compressed as the course 
progresses. 

An emphasis is placed on the efficient machinery of smooth analysis. In the ex- 
position of the theory I have tried (to the extent of my knowledge) to point out the 
most essential methods and facts and avoid the temptation of a minor strengthening 
of a theorem at the price of a major complication of its proof. 

The exposition is geometric throughout wherever this seemed worthwhile in or- 
der to reveal the essence of the matter. 

The main text is supplemented with a rather large collection of examples, and 
nearly every section ends with a set of problems that I hope will significantly com- 
plement even the theoretical part of the main text. Following the wonderful prece- 
dent of Pélya and Szegé, I have often tried to present a beautiful mathematical result 
or an important application as a series of problems accessible to the reader. 

The arrangement of the material was dictated not only by the architecture 
of mathematics in the sense of Bourbaki, but also by the position of analysis 
as a component of a unified mathematical or, one should rather say, natural- 
science/mathematical education. 


In Content This course is being published in two books (Part 1 and Part 2). 

The present Part 1 contains the differential and integral calculus of functions of 
one variable and the differential calculus of functions of several variables. 

In differential calculus we emphasize the role of the differential as a linear stan- 
dard for describing the local behavior of the variation of a variable. In addition to 
numerous examples of the use of differential calculus to study functional relations 
(monotonicity, extrema) we exhibit the role of the language of analysis in writing 
simple differential equations — mathematical models of real-world phenomena and 
the substantive problems connected with them. 

We study a number of such problems (for example, the motion of a body of vari- 
able mass, a nuclear reactor, atmospheric pressure, motion in a resisting medium) 
whose solution leads to important elementary functions. Full use is made of the lan- 
guage of complex variables; in particular, Euler’s formula is derived and the unity 
of the fundamental elementary functions is shown. 

The integral calculus has consciously been explained as far as possible using intu- 
itive material in the framework of the Riemann integral. For the majority of applica- 
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tions, this is completely adequate.* Various applications of the integral are pointed 
out, including those that lead to an improper integral (for example, the work in- 
volved in escaping from a gravitational field, and the escape velocity for the Earth’s 
gravitational field) or to elliptic functions (motion in a gravitational field in the pres- 
ence of constraints, pendulum motion). 

The differential calculus of functions of several variables is very geometric. In 
this topic, for example, one studies such important and useful consequences of the 
implicit function theorem as curvilinear coordinates and local reduction to canonical 
form for smooth mappings (the rank theorem) and functions (Morse’s lemma), and 
also the theory of extrema with constraint. 

Results from the theory of continuous functions and differential calculus are sum- 
marized and explained in a general invariant form in two chapters that link up nat- 
urally with the differential calculus of real-valued functions of several variables. 
These two chapters open the second part of the course. The second book, in which 
we also discuss the integral calculus of functions of several variables up to the gen- 
eral Newton—Leibniz—Stokes formula thus acquires a certain unity. 

We shall give more complete information on the second book in its preface. At 
this point we add only that, in addition to the material already mentioned, it contains 
information on series of functions (power series and Fourier series included), on in- 
tegrals depending on a parameter (including the fundamental solution, convolution, 
and the Fourier transform), and also on asymptotic expansions (which are usually 
absent or insufficiently presented in textbooks). 

We now discuss a few particular problems. 


On the Introduction I have not written an introductory survey of the subject, 
since the majority of beginning students already have a preliminary idea of differen- 
tial and integral calculus and their applications from high school, and I could hardly 
claim to write an even more introductory survey. Instead, in the first two chapters 
I bring the former high-school student’s understanding of sets, functions, the use 
of logical symbolism, and the theory of a real number to a certain mathematical 
completeness. 

This material belongs to the formal foundations of analysis and is aimed pri- 
marily at the mathematics major, who may at some time wish to trace the logical 
structure of the basic concepts and principles used in classical analysis. Mathemat- 
ical analysis proper begins in the third chapter, so that the reader who wishes to 
get effective machinery in his hands as quickly as possible and see its applications 
can in general begin a first reading with Chap. 3, turning to the earlier pages when- 
ever something seems nonobvious or raises a question which hopefully I also have 
thought of and answered in the early chapters. 


On the Division of Material The material of the two books is divided into chap- 
ters numbered continuously. The sections are numbered within each chapter sepa- 


3The “stronger” integrals, as is well known, require fussier set-theoretic considerations, outside the 
mainstream of the textbook, while adding hardly anything to the effective machinery of analysis, 
mastery of which should be the first priority. 
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rately; subsections of a section are numbered only within that section. Theorems, 
propositions, lemmas, definitions, and examples are written in italics for greater 
logical clarity, and numbered for convenience within each section. 


On the Supplementary Material Several chapters of the book are written as a 
natural extension of classical analysis. These are, on the one hand, Chaps. | and 2 
mentioned above, which are devoted to its formal mathematical foundations, and on 
the other hand, Chaps. 9, 10, and 15 of the second part, which give the modern view 
of the theory of continuity, differential and integral calculus, and finally Chap. 19, 
which is devoted to certain effective asymptotic methods of analysis. 

The question as to which part of the material of these chapters should be included 
in a lecture course depends on the audience and can be decided by the lecturer, but 
certain fundamental concepts introduced here are usually present in any exposition 
of the subject to mathematicians. 

In conclusion, I would like to thank those whose friendly and competent profes- 
sional aid has been valuable and useful to me during the work on this book. 

The proposed course was quite detailed, and in many of its aspects it was co- 
ordinated with subsequent modern university mathematics courses — such as, for 
example, differential equations, differential geometry, the theory of functions of 
a complex variable, and functional analysis. In this regard my contacts and dis- 
cussions with V.I. Arnol’d and the especially numerous ones with S.P. Novikov 
during our joint work with the so-called “experimental student group in natural- 
science/mathematical education” in the Department of Mathematics at MSU, were 
very useful to me. 

I received much advice from N.V. Efimov, chair of the Section of Mathemati- 
cal Analysis in the Department of Mechanics and Mathematics at Moscow State 
University. 

I am also grateful to colleagues in the department and the section for remarks on 
the mimeographed edition of my lectures. 

Student transcripts of my recent lectures which were made available to me were 
valuable during the work on this book, and I am grateful to their owners. 

I am deeply grateful to the official reviewers L.D. Kudryavtsev, V.P. Petrenko, 
and S.B. Stechkin for constructive comments, most of which were taken into ac- 
count in the book now offered to the reader. 


Moscow, Russia V. Zorich 
1980 
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Chapter 1 
Some General Mathematical Concepts 
and Notation 


1.1 Logical Symbolism 


1.1.1 Connectives and Brackets 


The language of this book, like the majority of mathematical texts, consists of ordi- 

nary language and a number of special symbols from the theories being discussed. 

Along with the special symbols, which will be introduced as needed, we use the 

common symbols of mathematical logic =, A, V, =, and } to denote respectively 

negation (not) and the logical connectives and, or, implies, and is equivalent to.' 
For example, take three statements of independent interest: 


L. If the notation is adapted to the discoveries..., the work of thought is mar- 
velously shortened. (G. Leibniz)* 

P. Mathematics is the art of calling different things by the same name. (H. Poinca- 
ré).3 


G. The great book of nature is written in the language of mathematics. (Galileo).* 


Then, according to the notation given above, Table 1.1 relates L, P, G. 


'The symbol & is often used in logic in place of A. Logicians more often write the implication 
symbol => as — and the relation of logical equivalence as <-—> or <>. However, we shall adhere 
to the symbolism indicated in the text so as not to overburden the symbol —, which has been 
traditionally used in mathematics to denote passage to the limit. 


2G.W. Leibniz (1646-1716) — outstanding German scholar, philosopher, and mathematician to 
whom belongs the honor, along with Newton, of having discovered the foundations of the in- 
finitesimal calculus. 


3H. Poincaré (1854-1912) — French mathematician whose brilliant mind transformed many areas 
of mathematics and achieved fundamental applications of it in mathematical physics. 
4Galileo Galilei (1564-1642) — Italian scholar and outstanding scientific experimenter. His works 


lie at the foundation of the subsequent physical concepts of space and time. He is the father of 
modern physical science. 
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Table 1.1 

Notation Meaning 

L>P L implies P 

LSP L is equivalent to P 

((L=> P)A(-P)) => (-L) If P follows from L and P is false, then L is false 
A(LSG)v(P$G)) G is not equivalent either to L or to P 


We see that it is not always reasonable to use only formal notation, avoiding 
colloquial language. 

We remark further that parentheses are used in the writing of complex statements 
composed of simpler ones, fulfilling the same syntactical function as in algebraic 
expressions. As in algebra, in order to avoid the overuse of parentheses one can 
make a convention about the order of operations. To that end, we shall agree on the 
following order of priorities for the symbols: 


Ty, AN, V;, >, >. 


With this convention the expression ~A A B V C => D should be interpreted 
as (((-A) A B) V C) = D, and the relation A Vv B > C as (AV B) > C, not as 
AV(B=>C). 

We shall often give a different verbal expression to the notation A > B, which 
means that A implies B, or, what is the same, that B follows from A, saying that 
B is anecessary criterion or necessary condition for A and A in turn is a sufficient 
condition or sufficient criterion for B, so that the relation A < B can be read in any 
of the following ways: 


A is necessary and sufficient for B; 

A holds when B holds, and only then; 
A if and only if B; 

A is equivalent to B. 


Thus the notation A < B means that A implies B and simultaneously B im- 
plies A. 

The use of the conjunction and in the expression A A B requires no explanation. 

It should be pointed out, however, that in the expression A V B the conjunction 
or is not exclusive, that is, the statement A Vv B is regarded as true if at least one 
of the statements A and B is true. For example, let x be a real number such that 
x? — 3x +2=0. Then we can write that the following relation holds: 


(x* — 3x +2=0) 6 =) V(x =2). 


1.1.2 Remarks on Proofs 


A typical mathematical proposition has the form A => B, where A is the assump- 
tion and B the conclusion. The proof of such a proposition consists of constructing 
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achain A> C, >---=>C, => B of implications, each element of which is either 
an axiom or a previously proved proposition.” 

In proofs we shall adhere to the classical rule of inference: if A is true and 
A= B, then B is also true. 

In proof by contradiction we shall also use the law of excluded middle, by virtue 
of which the statement A V —A (A or not-A) is considered true independently of the 
specific content of the statement A. Consequently we simultaneously accept that 
—=(7A) = A, that is, double negation is equivalent to the original statement. 


1.1.3: Some Special Notation 


For the reader’s convenience and to shorten the writing, we shall agree to denote the 
end of a proof by the symbol 
We also agree, whenever convenient, to introduce definitions using the special 
symbol := (equality by definition), in which the colon is placed on the side of the 
object being defined. 
For example, the notation 


b 
i f(x) dx = Gane P,é) 


defines the left-hand side in terms of the right-hand side, whose meaning is assumed 
to be known. 

Similarly, one can introduce abbreviations for expressions already defined. For 
example 


Yo fG)Ax = 0 (f: PE) 


i=l 


introduces the notation 0 (f; P, &) for the sum of special form on the left-hand side. 


1.1.4 Concluding Remarks 


We note that here we have spoken essentially about notation only, without analyzing 
the formalism of logical deductions and without touching on the profound questions 
of truth, provability, and deducibility, which form the subject matter of mathematical 
logic. 

How are we to construct mathematical analysis if we have no formalization of 
logic? There may be some consolation in the fact that we always know more than 


5The notation A = B => C will be used as an abbreviation for (A > B) A(B > C). 


4 1 Some General Mathematical Concepts and Notation 


we can formalize at any given time, or perhaps we should say we know how to do 
more than we can formalize. This last sentence may be clarified by the well-known 
proverb of the centipede who forgot how to walk when asked to explain exactly how 
it dealt with so many legs. 

The experience of all the sciences convinces us that what was considered clear 
or simple and unanalyzable yesterday may be subjected to reexamination or made 
more precise today. Such was the case (and will undoubtedly be the case again) 
with many concepts of mathematical analysis, the most important theorems and 
machinery of which were discovered in the seventeenth and eighteenth centuries, 
but which acquired its modern formalized form with a unique interpretation that is 
probably responsible for its being generally accessible, only after the creation of the 
theory of limits and the fully developed theory of real numbers needed for it in the 
nineteenth century. 

This is the level of the theory of real numbers from which we shall begin to 
construct the whole edifice of analysis in Chap. 2. 

As already noted in the preface, those who wish to make a rapid acquaintance 
with the basic concepts and effective machinery of differential and integral calculus 
proper may begin immediately with Chap. 3, turning to particular places in the first 
two chapters only as needed. 


11.5 Exercises 


We shall denote true assertions by the symbol | and false ones by 0. Then to each 
of the statements —A, AA B, AV B, and A=> B one can associate a so-called truth 
table, which indicates its truth or falsehood depending on the truth of the statements 
A and B. These tables are a formal definition of the logical operations —, A, V, >. 
Here they are: 


aA A fof1 AAB cae 

A 
=A |1\0 

0 jolo 

of 

AV B Bll, A=>B Bla, 
A A 

0 joi 0 fifi 

1 [afi 1 jo} 


1. Check whether all of these tables agree with your concept of the corresponding 
logical operation. (In particular, pay attention to the fact that if A is false, then the 
implication A => B is always true.) 
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2. Show that the following simple, but very useful relations, which are widely used 
in mathematical reasoning, are true: 


a) (AA B)S&-AV-B; 
b) -(AV B)S-AA-B; 
c) AS B)S(-B=>—A); 
d) (AS>B)S(CAV B); 
e) A(A> B)SAARAB. 


1.2 Sets and Elementary Operations on Them 


1.2.1 The Concept of a Set 


Since the late nineteenth and early twentieth centuries the most universal language 
of mathematics has been the language of set theory. This is even manifest in one of 
the definitions of mathematics as the science that studies different structures (rela- 
tions) on sets.°® 

“We take a set to be an assemblage of definite, perfectly distinguishable objects 
of our intuition or our thought into a coherent whole.” Thus did Georg Cantor,’ the 
creator of set theory, describe the concept of a set. 

Cantor’s description cannot, of course, be considered a definition, since it appeals 
to concepts that may be more complicated than the concept of a set itself (and in any 
case, have not been defined previously). The purpose of this description is to explain 
the concept by connecting it with other concepts. 

The basic assumptions of Cantorian (or, as it is generally called, “naive’’) set 
theory reduce to the following statements. 


1°. A set may consist of any distinguishable objects. 

2°. A set is unambiguously determined by the collection of objects that comprise 
it. 

3°. Any property defines the set of objects having that property. 

If x is an object, P is a property, and P(x) denotes the assertion that x has 
property P, then the class of objects having the property P is denoted {x | P(x)}. 
The objects that constitute a class or set are called the elements of the class or set. 

The set consisting of the elements x1,...,x, is usually denoted {x1,..., Xn}. 
Wherever no confusion can arise we allow ourselves to denote the one-element set 
{a} simply as a. 

The words “class”, “family”, “totality”, and “collection” are used as synonyms 
for “set” in naive set theory. 


6Bourbaki, N. “The architecture of mathematics” in: N. Bourbaki, Elements of the history of math- 
ematics, translated from the French by John Meldrum, Springer, New York, 1994. 


7G. Cantor (1845-1918) — German mathematician, the creator of the theory of infinite sets and the 
progenitor of set-theoretic language in mathematics. 
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The following examples illustrate the application of this terminology: 


— the set of letters “a” occurring in the word “T’; 

— the set of wives of Adam; 

— the collection of ten decimal digits; 

— the family of beans; 

— the set of grains of sand on the Earth; 

— the totality of points of a plane equidistant from two given points of the plane; 
— the family of sets; 

— the set of all sets. 


The variety in the possible degree of determinacy in the definition of a set leads 
one to think that a set is, after all, not such a simple and harmless concept. 
And in fact the concept of the set of all sets, for example, is simply contradictory. 


Proof Indeed, suppose that for a set M the notation P(M) means that M is not an 
element of itself. 

Consider the class K = {M | P(M)} of sets having property P. 

If K is a set either P(K) or —P(K) is true. However, this dichotomy does not 
apply to K. Indeed, P(K) is impossible; for it would then follow from the definition 
of K that K contains K as an element, that is, that —P(K) is true; on the other hand, 
—=P(K) is also impossible, since that means that K contains K as an element, which 
contradicts the definition of K as the class of sets that do not contain themselves as 
elements. 

Consequently K is not a set. 


This is the classical paradox of Russell,® one of the paradoxes to which the naive 
conception of a set leads. 

In modern mathematical logic the concept of a set has been subjected to detailed 
analysis (with good reason, as we see). However, we shall not go into that anal- 
ysis. We note only that in the current axiomatic set theories a set is defined as a 
mathematical object having a definite collection of properties. 

The description of these properties constitutes an axiom system. The core of 
axiomatic set theory is the postulation of rules by which new sets can be formed 
from given ones. In general any of the current axiom systems is such that, on the 
one hand, it eliminates the known contradictions of the naive theory, and on the 
other hand it provides freedom to operate with specific sets that arise in different 
areas of mathematics, most of all, in mathematical analysis understood in the broad 
sense of the word. 

Having confined ourselves for the time being to remarks on the concept of a 
set, we pass to the description of the set-theoretic relations and operations most 
commonly used in analysis. 

Those wishing a more detailed acquaintance with the concept of a set should 
study Sect. 1.4.2 in the present chapter or turn to the specialized literature. 


8B, Russell (1872-1970) — British logician, philosopher, sociologist and social activist. 
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Fig. 1.1 AEE 


1.2.2 The Inclusion Relation 


As has already been pointed out, the objects that comprise a set are usually called the 
elements of the set. We tend to denote sets by uppercase letters and their elements 
by the corresponding lowercase letters. 

The statement, “x is an element of the set X” is written briefly as 


xeExX (orX 5x), 


and its negation as 
x¢X (orX Xx). 


When statements about sets are written, frequent use is made of the logical oper- 
ators 5 (“there exists” or “there are’) and V (“every” or “for any”) which are called 
the existence and generalization quantifiers respectively. 

For example, the string Vx ((x € A) (x € B)) means that for any object x the 
relations x € A and x € B are equivalent. Since a set is completely determined by 
its elements, this statement is usually written briefly as 


A=B, 


read “A equals B’’, and means that the sets A and B are the same. 

Thus two sets are equal if they consist of the same elements. 

The negation of equality is usually written as A # B. 

If every element of A is an element of B, we write A C B or BD A and say 
that A is a subset of B or that B contains A or that B includes A. In this con- 
nection the relation A C B between sets A and B is called the inclusion relation 
(Fig. 1.1). 

Thus 


(A C B):=Yx ((x € A) > (x B)). 


If A Cc B and A ¥ B, we shall say that the inclusion A C B is strict or that A is 
a proper subset of B. 
Using these definitions, we can now conclude that 


(A=B)S(ACB)A(BCA). 


8 1 Some General Mathematical Concepts and Notation 


Fig. 1.2 rea: 


If M is aset, any property P distinguishes in M the subset 
{x €M| P(x)} 


consisting of the elements of M that have the property. 
For example, it is obvious that 


M={xeM|xeM}. 


On the other hand, if P is taken as a property that no element of the set M has, for 
example, P(x) := (x # x), we obtain the set 


S@={xeM|x Fx}, 


called the empty subset of M. 


1.2.3. Elementary Operations on Sets 


Let A and B be subsets of a set M. 
a. The union of A and B is the set 
AUB:={xeM|(xeA)V(xe B)}, 
consisting of precisely the elements of M that belong to at least one of the sets A 
and B (Fig. 1.2). 
b. The intersection of A and B is the set 


ANB:={xEM|@weA)AWeEB)}, 


formed by the elements of M that belong to both sets A and B (Fig. 1.3). 
c. The difference between A and B is the set 


A\B:={xeM|(weA)AWEB)I, 


consisting of the elements of A that do not belong to B (Fig. 1.4). 
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Fig. 1.3 


ANB 
M 
Fig. 1.4 A\B 
M 
Fig. 1.5 GA 
M 


The difference between the set M and one of its subsets A is usually called 
the complement of A in M and denoted Cy A, or CA when the set in which the 
complement of A is being taken is clear from the context (Fig. 1.5). 


Example As an illustration of the interaction of the concepts just introduced, let us 
verify the following relations (the so-called de Morgan” rules): 


Cu(AU B) =CyANCyB, (1.1) 
Cu(An B) =CyAUCyB. (1.2) 
Proof We shall prove the first of these equalities by way of example: 
(x €Cy(AU B)) > (x ¢ (AUB)) > (@ EAA EB) S 
=> (x € Cy A) A (x € CyB) = (x € (CuANCyB)). 
Thus we have established that 


Cu(AUB)C (CMAN CyB). (1.3) 


9A. de Morgan (1806-1871) — Scottish mathematician. 
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On the other hand, 
(x €(CyANCyB)) => ((x € Cu A) A (x € Cu B)) > 
=> (x ¢ A) A (x ¢ B)) > (x €(AUB)) > 
=> (x €Cy(AUB)), 
that is, 
(CuAN CyB) C Cy(AUB). (1.4) 
Equation (1.1) follows from (1.3) and (1.4). 


d. The direct (Cartesian) product of sets. For any two sets A and B one can form a 
new set, namely the pair {A, B} = {B, A}, which consists of the sets A and B and 
no others. This set has two elements if A ~ B and one element if A = B. 


This set is called the unordered pair of sets A and B, to be distinguished from the 
ordered pair (A, B) in which the elements are endowed with additional properties 
to distinguish the first and second elements of the pair {A, B}. The equality 


(A, B) = (C, D) 


between two ordered pairs means by definition that A = C and B = D. In particular, 
if A 4 B, then (A, B) 4 (B, A). 
Now let X and Y be arbitrary sets. The set 


XxY:={(@x,y)|@eX)AveY)}, 


formed by the ordered pairs (x, y) whose first element belongs to X and whose 
second element belongs to Y, is called the direct or Cartesian product of the sets X 
and Y (in that order!). 

It follows obviously from the definition of the direct product and the remarks 
made above about the ordered pair that in general X x Y AY x X. Equality holds 
only if X = Y. In this last case we abbreviate X x X as X?. 

The direct product is also called the Cartesian product in honor of Descartes, !° 
who arrived at the language of analytic geometry in terms of a system of coordi- 
nates independently of Fermat.!'! The familiar system of Cartesian coordinates in 
the plane makes this plane precisely into the direct product of two real axes. This 
familiar object shows vividly why the Cartesian product depends on the order of the 
factors. For example, different points of the plane correspond to the pairs (0, 1) and 
(1, 0). 


!0R. Descartes (1596-1650) — outstanding French philosopher, mathematician and physicist who 
made fundamental contributions to scientific thought and knowledge. 


‘Ip Fermat (1601-1665) — remarkable French mathematician, a lawyer by profession. He was 
one of the founders of a number of areas of modern mathematics: analysis, analytic geometry, 
probability theory, and number theory. 
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In the ordered pair z = (x1, x2), which is an element of the direct product Z = 
X xX X2 of the sets X; and X2, the element x, is called the first projection of the 
pair z and denoted pr, z, while the element x2 is the second projection of z and is 
denoted pry z. 

By analogy with the terminology of analytic geometry, the projections of an or- 
dered pair are often called the (first and second) coordinates of the pair. 


1.2.4 Exercises 


In Exercises 1, 2, and 3 the letters A, B, and C denote subsets of a set M. 
1. Verify the following relations. 


a) (ACC)A(BCC)S (AUB) CO); 
b) (CC A)A(CCB)S(CC(ANB)); 
c) Cu(CmMA)=A; 

d) (ACCyB)<s(BCCyA); 

e) (AC B)S(CyAD CyB). 


2. Prove the following statements. 


a) AU(BUC) =(AUB)UC =: AUBUC; 
b) AN(BNC)=(ANB)NC=:ANBNC; 
c) AN(BUC) =(ANB)U(ANC); 
d) AU(BNC)=(AUB)N(AUC). 


3. Verify the connection (duality) between the operations of union and intersection: 


a) Cu(AU B) = CyAN CyB; 
b) Cu(ANB)=CywAUCyA. 


4. Give geometric representations of the following Cartesian products. 


a) The product of two line segments (a rectangle). 

b) The product of two lines (a plane). 

c) The product of a line and a circle (an infinite cylindrical surface). 
d) The product of a line and a disk (an infinite solid cylinder). 

e) The product of two circles (a torus). 

f) The product of a circle and a disk (a solid torus). 


5. The set A = {(x1, x2) € xX? | x1 = x2} is called the diagonal of the Cartesian 
square X* of the set X. Give geometric representations of the diagonals of the sets 
obtained in parts a), b), and e) of Exercise 4. 

6. Show that 


a) (X x Y=9)S (X =98)Vv (Y =), andif X x Y 4 @, then 
b) (AX BCXXY)S(ACX)A(BCY), 
c) (Xx Y)U(Zx Y)=(XUZ)x Y, 
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d) (X x YYN(X’ x Y) =(XNX') x (YNY’). 
Here @ denotes the empty set, that is, the set having no elements. 


7. By comparing the relations of Exercise 3 with relations a) and b) from Exercise 2 
of Sect. 1.1, establish a correspondence between the logical operators =, A, V and 
the operations C, M, and U on sets. 


1.3 Functions 


1.3.1 The Concept of a Function (Mapping) 


We shall now describe the concept of a functional relation, which is fundamental 
both in mathematics and elsewhere. 

Let X and Y be certain sets. We say that there is a function defined on X with 
values in Y if, by virtue of some rule f, to each element x € X there corresponds 
an element y € Y. 

In this case the set X is called the domain of definition of the function. The 
symbol x used to denote a general element of the domain is called the argument 
of the function, or the independent variable. The element yo € Y corresponding to 
a particular value x9 € X of the argument x is called the value of the function at 
xo, or the value of the function at the value x = xo of its argument, and is denoted 
f (xo). As the argument x € X varies, the value y = f(x) € Y, in general, varies 
depending on the values of x. For that reason, the quantity y = f(x) is often called 
the dependent variable. 

The set 


f(X):= {y €¥ | ax (WE X)A(y= f@))} 


of values assumed by a function on elements of the set X will be called the set of 
values or the range of the function. 

The term “function” has a variety of useful synonyms in different areas of math- 
ematics, depending on the nature of the sets X and Y: mapping, transformation, 
morphism, operator, functional. The commonest is mapping, and we shall also use 
it frequently. 

For a function (mapping) the following notations are standard: 


f:X oY, X-SY. 


When it is clear from the context what the domain and range of a function are, 
one also uses the notation x > f(x) or y= f(x), but more frequently a function in 
general is simply denoted by the single symbol /. 

Two functions f; and f2 are considered identical or equal if they have the same 
domain X and at each element x € X the values f\(x) and f(x) are the same. In 
this case we write f| = fo. 
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If Ac X and f : X —> Y is a function, we denote by f|A or f|,4 the function 
g:A-— Y that agrees with f on A. More precisely, f|4(x) := p(x) if x € A. The 
function f|, is called the restriction of f to A, and the function f : X — Y is called 
an extension or a continuation of @ to X. 

We see that it is sometimes necessary to consider a function g : A > Y defined 
on a subset A of some set X while the range g(A) of g may also turn out be a subset 
of Y that is different from Y. In this connection, we sometimes use the term domain 
of departure of the function to denote any set X containing the domain of a function, 
and domain of arrival to denote any subset of Y containing its range. 

Thus, defining a function (mapping) involves specifying a triple (X, Y, f), where 


X is the set being mapped, or domain of the function; 

Y is the set into which the mapping goes, or a domain of arrival of the function; 
f is the rule according to which a definite element y € Y is assigned to each 
element x € X. 


The asymmetry between X and Y that appears here reflects the fact that the 
mapping goes from X to Y, and not the other direction. 
Now let us consider some examples of functions. 


Example I The formulas / = 27r and V = sar? establish functional relationships 
between the circumference / of a circle and its radius r and between the volume V 
of a ball and its radius r. Each of these formulas provides a particular function 
f :R+— Ry, defined on the set R+ of positive real numbers with values in the 
same set. 


Example 2 Let X be the set of inertial coordinate systems and c: X — R the func- 
tion that assigns to each coordinate system x € X the value c(x) of the speed of light 
in vacuo measured using those coordinates. The function c : X — R is constant, that 
is, for any x € X it has the same value c. (This is a fundamental experimental fact.) 


Example 3 The mapping G : R? — R? (the direct product R* =R x R=R, x Ry 
of the time axis R, and the spatial axis R,.) into itself defined by the formulas 


x’ =x— ut, 


is the classical Galilean transformation for transition from one inertial coordinate 
system (x,t) to another system (x’,t’) that is in motion relative to the first at 
speed v. 

The same purpose is served by the mapping L : R* —> R? defined by the relations 


; x—vt 
ye 
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te 


= Arm 


This is the well-known (one-dimensional) Lorentz? transformation, which plays 
a fundamental role in the special theory of relativity. The speed c is the speed of 
light. 


Example 4 The projection pr; : X; x X2 — X, defined by the correspondence 


X, x X22 (x1, +s ay € X, is obviously a function. The second projection 
pl: X1 x X2 — Xz is defined similarly. 


Example 5 Let P(M) be the set of subsets of the set M. To each set A € P(M) we 
assign the set Cy A € P(M), that is, the complement to A in M. We then obtain a 
mapping Cy :P(M) > P(M) of the set P(M) into itself. 


Example 6 Let E C M. The real-valued function x ¢ : M — R defined on the set M@ 
by the conditions (yzg(x) = Lifx € E) A (xe(x) = Oif x € Cy E) is called the 
characteristic function of the set E. 


Example 7 Let M(X; Y) be the set of mappings of the set X into the set Y and xo 
a fixed element of X. To any function f ¢ M(X; Y) we assign its value f(xo) € Y 
at the element xo. This relation defines a function F : M(X; Y) — Y. In particular, 
if Y =R, that is, Y is the set of real numbers, then to each function f : X — R the 
function F : M(X; IR) —> R assigns the number F'(f) = f (xo). Thus F is a function 
defined on functions. For convenience, such functions are called functionals. 


Example 8 Let I be the set of curves lying on a surface (for example, the surface 
of the earth) and joining two given points of the surface. To each curve y € I” one 
can assign its length. We then obtain a function F :  — R that often needs to be 
studied in order to find the shortest curve, or as it is called, the geodesic between the 
two given points on the surface. 


Example 9 Consider the set M(IR; R) of real-valued functions defined on the entire 
real line IR. After fixing a number a € R, we assign to each function f ¢ M(R; R) 
the function f, € M(R; R) connected with it by the relation f,(x) = f(x +a). The 
function f(x) is usually called the translate or shift of the function f by a. The 
mapping A: M(R; R) ~ M(R; R) that arises in this way is called the translation 
of shift operator. Thus the operator A is defined on functions and its values are also 
functions fg = A(f). 


'°17.A. Lorentz (1853-1928) — outstanding Dutch theoretical physicist. Poincaré called these trans- 
formations Lorentz transformations in honor of Lorentz, who stimulated the research of symme- 
tries in Maxwell’s equations. They were used by Einstein in 1905 in the formulation of his theory 
of special relativity. 


1.3 Functions 15 


This last example might seem artificial if not for the fact that we encounter real 


; hettens F oa 
operators at every turn. Thus, any radio receiver is an operator f +— f that trans- 
forms electromagnetic signals f into acoustic signals f; any of our sensory organs 
is an operator (transformer) with its own domain of definition and range of values. 


Example 10 The position of a particle in space is determined by an ordered triple 
of numbers (x, y, z) called its spatial coordinates. The set of all such ordered triples 
can be thought of as the direct product R x R x R = R? of three real lines R. 

A particle in motion is located at some point of the space IR? having coordinates 
(x(t), y(t), z(t)) at each instant t of time. Thus the motion of a particle can be 
interpreted as a mapping y : R > R®, where R is the time axis and R? is three- 
dimensional space. 

If a system consists of n particles, its configuration is defined by the position of 
each of the particles, that is, it is defined by an ordered set (x1, v1, 213 X2, 2, 223 ---3 
Xn> Yas Zn) Consisting of 3n numbers. The set of all such ordered sets is called the 
configuration space of the system of n particles. Consequently, the configuration 
space of a system of n particles can be interpreted as the direct product R* x R? x 
.-- x R? = R*” of n copies of R?. 

To the motion of a system of n particles there corresponds a mapping y : RR > 
IR” of the time axis into the configuration space of the system. 


Example 11 The potential energy U of a mechanical system is connected with the 
mutual positions of the particles of the system, that is, it is determined by the config- 
uration that the system has. Let Q be the set of possible configurations of a system. 
This is a certain subset of the configuration space of the system. To each position 
q € Q there corresponds a certain value U (q) of the potential energy of the system. 
Thus the potential energy is a function U : Q > R defined on a subset Q of the 
configuration space with values in the domain R of real numbers. 


Example 12 The kinetic energy K of a system of n material particles depends on 
their velocities. The total mechanical energy of the system EF, definedas E = K+U, 
that is, the sum of the kinetic and potential energies, thus depends on both the con- 
figuration g of the system and the set of velocities v of its particles. Like the con- 
figuration q of the particles in space, the set of velocities v, which consists of n 
three-dimensional vectors, can be defined as an ordered set of 3n numbers. The or- 
dered pairs (g, v) corresponding to the states of the system form a subset ® in the 
direct product R*” x R*” = R®, called the phase space of the system of n particles 
(to be distinguished from the configuration space R>”). 

The total mechanical energy of the system is therefore a function E: 6 > R 
defined on the subset ® of the phase space R® and assuming values in the domain 
R of real numbers. 

In particular, if the system is isolated, that is, no external forces are acting on it, 
then by the law of conservation of energy, at each point of the set ® of states of the 
system the function E will have the same value Eo € R. 
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Fig. 1.6 


1.3.2 Elementary Classification of Mappings 


When a function f : X — Y is called a mapping, the value f(x) € Y that it assumes 
at the element x € Y is usually called the image of x. 
The image of a set A C X under the mapping f : X — Y is defined as the set 


f(A):={yeY¥ | 4x (@e AA (y= f@))} 


consisting of the elements of Y that are images of elements of A. 
The set 


f-'(B):={xeX| f(x) eB} 


consisting of the elements of X whose images belong to B is called the pre-image 
(or complete pre-image) of the set B C Y (Fig. 1.6). 
A mapping f : X — Y is said to be 


surjective (a mapping of X onto Y)if f(X) =Y; 
injective (or an imbedding or injection) if for any elements x,, x2 of X 


(f G1) = f@2)) > G1 = 2), 


that is, distinct elements have distinct images; 
bijective (or a one-to-one correspondence) if it is both surjective and injective. 


If the mapping f : X — Y is bijective, that is, it is a one-to-one correspondence 
between the elements of the sets X and Y, there naturally arises a mapping 


7, By Ss 


defined as follows: if f(x) = y, then f-'(y) = x that is, to each element y € Y 
one assigns the element x € X whose image under the mapping f is y. By the 
surjectivity of f there exists such an element, and by the injectivity of f, itis unique. 
Hence the mapping f~! is well-defined. This mapping is called the inverse of the 
original mapping f. 
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Fig. 1.7 


It is clear from the construction of the inverse mapping that f~! : Y > X is itself 
bijective and that its inverse (f~!)~! : X > Y is the same as the original mapping 
f:x-Y. 

Thus the property of two mappings of being inverses is reciprocal: if f—! is 
inverse for f, then f is inverse for f~!. 

We remark that the symbol f~!(B) for the pre-image of a set B C Y involves 
the symbol f~! for the inverse function; but it should be kept in mind that the pre- 
image of a set is defined for any mapping f : X — Y, even if it is not bijective and 
hence has no inverse. 


1.3.3, Composition of Functions and Mutually Inverse Mappings 


The operation of composition of functions is on the one hand a rich source of new 
functions and on the other hand a way of resolving complex functions into simpler 
ones. 

If the mappings f : X — Y and g: Y — Z are such that one of them (in our 
case g) is defined on the range of the other (f), one can construct a new mapping 


gof:xX>Z, 
whose values on elements of the set X are defined by the formula 


(go f)(x) = 8(f(@)). 


The compound mapping g o f so constructed is called the composition of the 
mapping f and the mapping g (in that order!). 

Figure 1.7 illustrates the construction of the composition of the mappings f 
and g. 

You have already encountered the composition of mappings many times, both in 
geometry, when studying the composition of rigid motions of the plane or space, 
and in algebra in the study of “complicated” functions obtained by composing the 
simplest elementary functions. 

The operation of composition sometimes has to be carried out several times in 
succession, and in this connection it is useful to note that it is associative, that is, 


ho(gof)=(hog)of. 
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Proof Indeed, 


ho(go f\(x)=h((go f)(x)) =A(g(f@))) = 
=(hog)(f(x)) =((hog)o f)(x). 


This circumstance, as in the case of addition and multiplication of several num- 
bers, makes it possible to omit the parentheses that prescribe the order of the pair- 
ings. 

If all the terms of a composition f, o---o fj are equal to the same function f, 
we abbreviate it to f”. 

It is well known, for example, that the square root of a positive number a can be 
computed by successive approximations using the formula 


mee: aust 
Ant+1 = 5 \ %n a ; 


starting from any initial approximation xg > 0. This none other than the successive 
computation of f”(x0), where f(x) = 5 (x + oi Such a procedure, in which the 
value of the function computed at the each step becomes its argument at the next 
step, is called a recursive procedure. Recursive procedures are widely used in math- 
ematics. 

We further note that even when both compositions g o f and f o g are defined, 
in general 


gsofAfog. 


Indeed, let us take for example the two-element set {a, b} and the mappings f : 
{a,b} > a and g: {a,b} — b. Then it is obvious that g o f : {a,b} — b while 
fog:{a,b}—a. 

The mapping f : X — X that assigns to each element of X the element itself, 


that is x ey y, will be denoted ey and called the identity mapping on X. 
Lemma 
(go f =ex) => (g is surjective) A (f is injective). 
Proof Indeed, if f: X — Y,g:Y > X,and go f=ex: X —> X, then 
X =ex(X) = (go f)(X) =8(f(X%) Ca¥) 


and hence g is surjective. 
Further, if x; € X and x2 € X, then 


(x1 #2) => (ex 1) Fex(a2)) > ((8 0 fa) 4 (80 f)(x2)) > 
=> (g(f1))) 4 8(f@2)) > (fn) 4 f@2)), 


and therefore f is injective. 
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Using the operation of composition of mappings one can describe mutually in- 
verse mappings. 


Proposition The mappings f : X — Y and g:Y > X are bijective and mutually 
inverse to each other if and only if go f =ex and f og =ey. 


Proof By the lemma the simultaneous fulfillment of the conditions g o f = ex and 
f °g =ey guarantees the surjectivity and injectivity, that is, the bijectivity, of both 
mappings. 

These same conditions show that y = f(x) if and only if x = g(y). 


In the preceding discussion we started with an explicit construction of the inverse 
mapping. It follows from the proposition just proved that we could have given a less 
intuitive, yet more symmetric definition of mutually inverse mappings as those map- 
pings that satisfy the two conditions go f = ex and f og = ey. (In this connection, 
see Exercise 6 at the end of this section.) 


1.3.4 Functions as Relations. The Graph of a Function 


In conclusion we return once again to the concept of a function. We note that it has 
undergone a lengthy and rather complicated evolution. 

The term function first appeared in the years from 1673 to 1692 in works of 
G. Leibniz (in a somewhat narrower sense, to be sure). By the year 1698 the term had 
become established in a sense close to the modern one through the correspondence 
between Leibniz and Johann Bernoulli.!* (The letter of Bernoulli usually cited in 
this regard dates to that same year.) 

Many great mathematicians have participated in the formation of the modern 
concept of functional dependence. 

A description of a function that is nearly identical to the one given at the be- 
ginning of this section can be found as early as the work of Euler (mid-eighteenth 
century) who also introduced the notation f(x). By the early nineteenth century it 
had appeared in the textbooks of S. Lacroix.'* A vigorous advocate of this concept 
of a function was N.I. Lobachevskii,!° who noted that “a comprehensive view of 


'3Johann Bernoulli (1667-1748) — one of the early representatives of the distinguished Bernoulli 
family of Swiss scholars; he studied analysis, geometry and mechanics. He was one of the founders 
of the calculus of variations. He gave the first systematic exposition of the differential and integral 
calculus. 


146 F. Lacroix (1765-1843) — French mathematician and educator (professor at the Ecole Normale 
and the Ecole Polytechnique, and member of the Paris Academy of Sciences). 
'SN.L Lobachevskii (1792-1856) — great Russian scholar, to whom belongs the credit — shared with 


the great German scientist C.F. Gauss (1777-1855) and the outstanding Hungarian mathematician 
J. Bélyai (1802-1860) — for having discovered the non-Euclidean geometry that bears his name. 
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theory admits only dependence relationships in which the numbers connected with 
each other are understood as if they were given as a single unit.”!® It is this idea of 
precise definition of the concept of a function that we are about to explain. 

The description of the concept of a function given at the beginning of this section 
is quite dynamic and reflects the essence of the matter. However, by modern canons 
of rigor it cannot be called a definition, since it uses the concept of a correspondence, 
which is equivalent to the concept of a function. For the reader’s information we 
shall show here how the definition of a function can be given in the language of set 
theory. (It is interesting that the concept of a relation, to which we are now turning, 
preceded the concept of a function, even for Leibniz.) 


a. Relations 


Definition 1 A relation R is any set of ordered pairs (x, y). 

The set X of first elements of the ordered pairs that constitute 7 is called the 
domain of definition of R, and the set Y of second elements of these pairs the range 
of values of R. 

Thus, a relation can be interpreted as a subset R of the direct product X x Y. If 
X CX’ and Y CY’, then of course R Cc X x Y C X’ x Y’, so that a given relation 
can be defined as a subset of different sets. 

Any set containing the domain of definition of a relation is called a domain of 
departure for that relation. A set containing the region of values is called a domain 
of arrival of the relation. 

Instead of writing (x, y) € R, we often write x7 y and say that x is connected 
with y by the relation R. 

If R C X”, we say that the relation R is defined on X. 

Let us consider some examples. 


Example 13 The diagonal 
A={(a,b)€ X?|a=b} 


is a subset of X* defining the relation of equality between elements of X. Indeed, 
aAb means that (a, b) € A, that is, a =b. 


Example 14 Let X be the set of lines in a plane. 

Two lines a € X and b € X will be considered to be in the relation 7, and we 
shall write aRb, if b is parallel to a. It is clear that this condition distinguishes a set 
R of pairs (a,b) in X* such that aRb. It is known from geometry that the relation 
of parallelism between lines has the following properties: 


aRa (reflexivity); 


161 obachevskii, N.L. Complete Works, Vol. 5, Moscow—Leningrad: Gostekhizdat, 1951, p. 44 (Rus- 
sian). 
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aRb => bRa (symmetry); 
(aRb) A (bRc) => aRe (transitivity). 


A relation R having the three properties just listed, that is, reflexivity,!’ sym- 
metry, and transitivity, is usually called an equivalence relation. An equivalence 
relation is denoted by the special symbol ~, which in this case replaces the letter ?. 
Thus, in the case of an equivalence relation we shall write a ~ b instead of aRb and 
say that a is equivalent to b. 


Example 15 Let M be a set and X = P(M) the set of its subsets. For two arbitrary 
elements a and b of X = P(M), that is, for two subsets a and b of M, one of the 
following three possibilities always holds: a is contained in b; b is contained in a; 
a is not a subset of b and b is not a subset of a. 

As an example of a relation R on X7, consider the relation of inclusion for sub- 
sets of M, that is, make the definition 


aRb:=(acCcb). 


This relation obviously has the following properties: 


aRa (reflexivity); 
(aRb) A (bRc) => aRe (transitivity); 
(aRb) A (bRa) => aAb, that is, a = b (antisymmetry). 


A relation between pairs of elements of a set X having these three properties is 
usually called a partial ordering on X. For a partial ordering relation on X, we often 
write a < b and say that b follows a. 

If the condition 


Va Vb ((aRb) v (bRa)) 


holds in addition to the last two properties defining a partial ordering relation, that 
is, any two elements of X are comparable, the relation 7? is called an ordering, and 
the set X with the ordering defined on it is said to be linearly ordered. 

The origin of this term comes from the intuitive image of the real line R on which 
a relation a < b holds between any pair of real numbers. 


b. Functions and Their Graphs 
A relation R is said to be functional if 


(xRy1) A (xRy2) => (1 = y2). 


'TFor the sake of completeness it is useful to note that a relation R is reflexive if its domain of 
definition and its range of values are the same and the relation aRa holds for any element a in the 
domain of ?. 
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A functional relation is called a function. 

In particular, if X and Y are two sets, not necessarily distinct, a relation R C 
X x Y between elements x of X and y of Y is a functional relation on X if for every 
x € X there exists a unique element y € Y in the given relation to x, that is, such 
that xy holds. 

Such a functional relation ® C X x Y is a mapping from X into Y, or a function 
from X into Y. 

We shall usually denote functions by the letter f. If f is a function, we shall 


write y = f(x) or x gts y, as before, rather than x fy, calling y = f(x) the value 
of f at x or the image of x under f. 

AS we now see, assigning an element y € Y “corresponding” to x € X in accor- 
dance with the “rule” f, as was discussed in the original description of the concept 
of a function, amounts to exhibiting for each x € X the unique y € Y such that xfy, 
that is,(x,y)e ff CXXY. 

The graph of a function f : X — Y, as understood in the original description, is 
the subset I” of the direct product X x Y whose elements have the form (x, f(x)). 
Thus 


r:= {(x, y) eX x Yly=f@}. 


In the new description of the concept of a function, in which we define it as a 
subset f C X x Y, of course, there is no longer any difference between a function 
and its graph. 

We have exhibited the theoretical possibility of giving a formal set-theoretic defi- 
nition of a function, which reduces essentially to identifying a function and its graph. 
However, we do not intend to confine ourselves to that way of defining a function. 
At times it is convenient to define a functional relation analytically, at other times 
by giving a table of values, and at still other times by giving a verbal description of a 
process (algorithm) making it possible to find the element y € Y corresponding to a 
given x € X. With each method of presenting a function it is meaningful to ask how 
the function could have been defined using its graph. This problem can be stated as 
the problem of constructing the graph of the function. Defining numerical-valued 
functions by a good graphical representation is often useful because it makes the 
basic qualitative properties of the functional relation visualizable. One can also use 
graphs (nomograms) for computations; but, as a rule, only in cases where high pre- 
cision is not required. For precise computations we do use the table definition of a 
function, but more often we use an algorithmic definition that can be implemented 
on a computer. 


1.3.5 Exercises 


1. The composition R2 o R, of the relations 7; and 72 is defined as follows: 


Rz oR = {(a,2) | Ay @Riy A yRa2)}. 
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In particular, if R; C X x Y andR2 CY x Z, then R = R20R, CX x Z, and 


XRz i= Ay ((y EY) A(kR1y) A (yRo2)). 


a) Let Ay be the diagonal of X? and Ay the diagonal of Y*. Show that if the 
relations R; C X x Y and R2 C Y x X are such that (R20R, = Ay) A(R10R2= 
Ay), then both relations are functional and define mutually inverse mappings of X 
and Y. 

b) Let R Cc X*. Show that the condition of transitivity of the relation R is equiv- 
alent to the condition RoR CR. 

c) The relation R’ Cc Y x X is called the transpose of the relation R C X x Y if 
(yR'x) & (Ry). 

Show that a relation R C X? is antisymmetric if and only if RAR’ C Ay. 

d) Verify that any two elements of X are connected (in some order) by the rela- 
tion R C X°? if and only if R UR’ = X?. 


2. Let f : X — Y be a mapping. The pre-image f~!(y) C X of the element y € Y 
is called the fiber over y. 


a) Find the fibers for the following mappings: 
pry: X1 x X2> Xi, pro: X1 x X2—> X2. 


b) An element x; € X will be considered to be connected with an element x2 € 
X by the relation R C X2, and we shall write x) Rx if FS (x1) = f (x2), that is, x; 
and x> both lie in the same fiber. 

Verify that R is an equivalence relation. 

c) Show that the fibers of a mapping f : X — Y do not intersect one another 
and that the union of all the fibers is the whole set X. 

d) Verify that any equivalence relation between elements of a set makes it pos- 
sible to represent the set as a union of mutually disjoint equivalence classes of ele- 
ments. 


3. Let f : X — Y be a mapping from X into Y. Show that if A and B are subsets 
of X, then 

a) (ACB)S>(f(AVC f(B) F(AC B). 

b) (A# 9D) => (f(A) FB), 

c) f(ANB)C f(A)N f(B), 

d) f(AUB)= f(A)U f(B); 
if A’ and B’ are subsets of Y, then 

e) (ACB) => (f(A Cf 1B), 

f) f~'(A' NB’) = f(A f-(B)), 

g) f-\(AUB) = fT(ADU f-1(B); 
if Y D> A’D B’, then 

h) f-'(A\B) = fol ANF 1B), 
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i) f-\(Cy A) =Cx f-'(A); 
and for any A Cc X and B’ CY 
j) f-'F(A) DA, 
k) ff BY) CB’. 
4. Show that the mapping f : X > Y is 


a) surjective if and only if f(f~!(B’)) = B’ for every set B’ CY; 
b) bijective if and only if 


(f-'(F(A)) = A) AF (F1(B) = 8) 
for every set A C X and every set B’ CY. 
5. Verify that the following statements about a mapping f : X — Y are equivalent: 
a) f is injective; 
b) f-!(f(A)) =A for every AC X; 
c) f(AN B)= f(A) NM f(B) for any two subsets A and B of X; 


d) f(AANf(B=SSANB=B; 
e) f(A\B) = f(A)\f(B) whenever X D AD B. 


6. a) If the mappings f : X — Y and g: Y > X are such that g o f = ey, where 
ex is the identity mapping on X, then g is called a left inverse of f and f a right 
inverse of g. Show that, in contrast to the uniqueness of the inverse mapping, there 
may exist many one-sided inverse mappings. 

Consider, for example, the mappings f : X — Y and g: Y > X, where X isa 
one-element set and Y a two-element set, or the mappings of sequences given by 


fa 
(X1,--.,Xn,-+-) RH (4, X],...,Xn,---)5 
8g 
(V2, 60+ Vive) (V1, Y25 +2 Vay ee) 


b) Let f: X > Y and g: Y — Z be bijective mappings. Show that the mapping 
go f:X — Z is bijective and that (go f)"! = fo!og™!. 
c) Show that the equality 


(go f) '(C)= f-'(g71(C)) 


holds for any mappings f : X > Y and g: Y > Z andany setC C Z. 

d) Verify that the mapping F : X x Y > Y x X defined by the correspondence 
(x, y) > (y, x) is bijective. Describe the connection between the graphs of mutually 
inverse mappings f : X > Y and f-!:Y > X. 


7. a) Show that for any mapping f : X — Y the mapping F: X — X x Y defined 


by the correspondence x mabe (x, f (x)) is injective. 
b) Suppose a particle is moving at uniform speed on a circle Y; let X be the 


time axis and x mae y the correspondence between the time x € X and the position 


1.4 Supplementary Material 25 


y = f(x) € Y of the particle. Describe the graph of the function f : X — Y in 
XxyY. 


8. a) For each of the examples 1-12 considered in Sect. 1.3 determine whether the 
mapping defined in the example is surjective, injective, or bijective or whether it 
belongs to none of these classes. 

b) Ohm’s law J = V/R connects the current J in a conductor with the potential 
difference V at the ends of the conductor and the resistance R of the conductor. 
Give sets X and Y for which some mapping O : X — Y corresponds to Ohm’s law. 
What set is the relation corresponding to Ohm’s law a subset of? 

c) Find the mappings G~! and L~! inverse to the Galilean and Lorentz trans- 
formations. 


9. a) A set S C X is stable with respect to a mapping f : X > X if f(S) CS. 
Describe the sets that are stable with respect to a shift of the plane by a given vector 
lying in the plane. 

b) A set J C X is invariant with respect to a mapping f : X > X if fU) =T. 
Describe the sets that are invariant with respect to rotation of the plane about a fixed 
point. 

c) A point p € X is a fixed point of a mapping f : X — X if f(p) = p. Verify 
that any composition of a shift, a rotation, and a similarity transformation of the 
plane has a fixed point, provided the coefficient of the similarity transformation is 
less than 1. 

d) Regarding the Galilean and Lorentz transformations as mappings of the plane 
into itself for which the point with coordinates (x,t) maps to the point with coordi- 
nates (x’, t’), find the invariant sets of these transformations. 


10. Consider the steady flow of a fluid (that is, the velocity at each point of the flow 
does not change over time). In time ¢ a particle at point x of the flow will move 
to some new point f;(x) of space. The mapping x b> f;(x) that arises thereby on 
the points of space occupied by the flow depends on time and is called the mapping 
after time t. Show that f;, ° fi, = fr, ° ft = fry +n and fr o fLi =ex. 


1.4 Supplementary Material 


1.4.1 The Cardinality of a Set (Cardinal Numbers) 


The set X is said to be equipollent to the set Y if there exists a bijective mapping 
of X onto Y, that is, a point y € Y is assigned to each x € X, the elements of Y 
assigned to different elements of X are different, and every point of Y is assigned to 
some point of X. 

Speaking fancifully, each element x € X has a seat all to itself in Y, and there are 
no vacant seats ye Y. 
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It is clear that the relation XY thereby introduced is an equivalence relation. 
For that reason we shall write X ~ Y instead of X72 Y, in accordance with our earlier 
convention. 

The relation of equipollence partitions the collection of all sets into classes of 
mutually equivalent sets. The sets of an equivalence class have the same number of 
elements (they are equipollent), and sets from different equivalence classes do not. 

The class to which a set X belongs is called the cardinality of X, and also the 
cardinal or cardinal number of X. It is denoted card X. If X ~ Y, we write card 
X =cardY. 

The idea behind this construction is that it makes possible a comparison of 
the numbers of elements in sets without resorting to an intermediate count, that 
is, without measuring the number by comparing it with the natural numbers N = 
{1,2,3,...}. Doing the latter, as we shall soon see, is sometimes not even theoreti- 
cally possible. 

The cardinal number of a set X is said to be not larger than the cardinal number 
of a set Y, and we write card X < card Y, if X is equipollent to some subset of Y. 

Thus, 


(card X <cardY):=4Z CY (card X =card Z). 


If X CY, it is clear that card X < card Y. It turns out, however, that the relation 
X CY does not exclude the inequality card Y < card X, even when X is a proper 
subset of Y. 

For example, the correspondence x b> Ca is a bijective mapping of the interval 
—1 <x <1 of the real axis R onto the entire axis. 

The possibility of being equipollent to a proper subset of itself is a characteristic 
of infinite sets that Dedekind!* even suggested taking as the definition of an infinite 
set. Thus a set is called finite (in the sense of Dedekind) if it is not equipollent to 
any proper subset of itself; otherwise, it is called infinite. 

Just as the relation of inequality orders the real numbers on a line, the inequality 
just introduced orders the cardinal numbers of sets. To be specific, one can prove 
that the relation just constructed has the following properties: 


1° (card X < card Y) A (card Y < card Z) => (card X < card Z) (obvious). 

2° (card X < cardY) A (cardY < card X) = (card X = cardY) (the Schréder— 
Bernstein theorem!’). 

3° VX VY (card X < card Y) v (card Y < card X) (Cantor’s theorem). 


Thus the class of cardinal numbers is linearly ordered. 


'8R, Dedekind (1831-1916) — German algebraist who took an active part in the development of the 
theory of a real number. He was the first to propose the axiomatization of the set of natural numbers 
usually called the Peano axiom system after G. Peano (1858-1932), the Italian mathematician who 
formulated it somewhat later. 


!9R Bernstein (1878-1956) — German mathematician, a student of G. Cantor. E. Schréder (1841- 
1902) — German mathematician. 
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We say that the cardinality of X is Jess than the cardinality of Y and write 
card X < card Y, if card X < card Y but card X ~ card Y. Thus (card X < card Y) := 
(card X <card Y) A (card X AcardY). 

As before, let @ be the empty set and P(X) the set of all subsets of the set X. 
Cantor made the following discovery: 


Theorem card X < cardP(X). 


Proof The assertion is obvious for the empty set, so that from now on we shall 
assume X 4 ©. 

Since P(X) contains all one-element subsets of X, card X < card P(X). 

To prove the theorem it now suffices to show that card X ¢ card P(X) if X 4S. 

Suppose, contrary to the assertion, that there exists a bijective mapping f : X > 
P(X). Consider the set A = {x € X : x € f(x)} consisting of the elements x € X 
that do not belong to the set f(x) € P(X) assigned to them by the bijection. Since 
Ae P(X), there exists a € X such that f(a) = A. For the element a the relation 
a € A is impossible by the definition of A, and the relation a ¢ A is impossible, 
also by the definition of A. We have thus reached a contradiction with the law of 
excluded middle. 


This theorem shows in particular that if infinite sets exist, then even “infinities” 
are not all the same. 


1.4.2 Axioms for Set Theory 


The purpose of the present subsection is to give the interested reader a picture of an 
axiom system that describes the properties of the mathematical object called a set 
and to illustrate the simplest consequences of those axioms. 


1°. (Axiom of extensionality) Sets A and B are equal if and only if they have the 
same elements. 


This means that we ignore all properties of the object known as a “set” except 
the property of having elements. In practice it means that if we wish to establish that 
A = B, we must verify that Vx ((x € A) (x € B)). 


2°. (Axiom of separation) Jo any set A and any property P there corresponds a 
set B whose elements are those elements of A, and only those, having prop- 
erty P. 


More briefly, it is asserted that if A is a set, then B = {x € A | P(x)} is also a set. 

This axiom is used very frequently in mathematical constructions, when we se- 
lect from a set the subset consisting of the elements having some property. 

For example, it follows from the axiom of separation that there exists an empty 
subset Oy = {x € X | x Ax} in any set X. By virtue of the axiom of extensionality 
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we conclude that @ y = @y for all sets X and Y, that is, the empty set is unique. We 
denote this set by @. 

It also follows from the axiom of separation that if A and B are sets, then A\ B = 
{x € A|x ¢ B} is also a set. In particular, if M is a set and A a subset of M, then 
Cy A is also a set. 


3°. (Union axiom) For any set M whose elements are sets there exists a set ()M, 
called the union of M and consisting of those elements and only those that 
belong to some element of M. 


If we use the phrase “family of sets” instead of “‘a set whose elements are sets” 
the axiom of union assumes a more familiar sound: there exists a set consisting of 
the elements of the sets in the family. Thus, a union of sets is a set, and x € U Ms 
AX (Xe M)A(xEX)). 

When we take account of the axiom of separation, the union axiom makes it 
possible to define the intersection of the set M (or family of sets) as the set 


mM = {xem ivx ((XeM) > (xe x))]. 


4°. (Pairing axiom) For any sets X and Y there exists a set Z such that X and Y 
are its only elements. 


The set Z is denoted {X, Y} and is called the unordered pair of sets X and Y. 
The set Z consists of one element if X = Y. 

As we have already pointed out, the ordered pair (X,Y) differs from the un- 
ordered pair by the presence of some property possessed by one of the sets in the 
pair. For example, (X, Y) := {{X, X}, {X, Y}}. 

Thus, the unordered pair makes it possible to introduce the ordered pair, and the 
ordered pair makes it possible to introduce the direct product of sets by using the 
axiom of separation and the following important axiom. 


5°. (Power set axiom) For any set X there exists a set P(X) having each subset of 
X as an element, and having no other elements. 


In short, there exists a set consisting of all the subsets of a given set. 
We can now verify that the ordered pairs (x, y), where x € X and y € Y, really 
do form a set, namely 


Xx Y:={peP (P(X)UP(Y)) |(p=Q,y)) A@WEX)A(EY)}I. 


Axioms 1°—5° limit the possibility of forming new sets. Thus, by Cantor’s the- 
orem (which asserts that card X < cardP(X)) there is an element in the set P(X) 
that does not belong to X. Therefore the “set of all sets” does not exist. And it was 
precisely on this “set” that Russell’s paradox was based. 

In order to state the next axiom we introduce the concept of the successor XT 
of the set X. By definition Xt = X U {X}. More briefly, the one-element set {X} is 
adjoined to X. 
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Further, a set is called inductive if the empty set is one of its elements and the 
successor of each of its elements also belongs to it. 


6°. (Axiom of infinity) There exist inductive sets. 


When we take Axioms 1°—4° into account, the axiom of infinity makes it possible 
to construct a standard model of the set No of natural numbers (in the sense of 
von N eumann),”” by defining No as the intersection of all inductive sets, that is, the 
smallest inductive set. The elements of No are 


@, SGt=GU{B}={O}, {Sr =(}UL[{}}, ..., 


which are a model for what we denote by the symbols 0, 1, 2, ... and call the natural 
numbers. 


7°. (Axiom of replacement) Let F (x, y) be a statement (more precisely, a formula) 
such that for every xo in the set X there exists a unique object yo such that 
F (xo, yo) is true. Then the objects y for which there exists an element x € X 
such that F (x, y) is true form a set. 


We shall make no use of this axiom in our construction of analysis. 

Axioms 1°-7° constitute the axiom system known as the Zermelo—Fraenkel ax- 
: 21 
ioms. 

To this system another axiom is usually added, one that is independent of Ax- 
ioms 1°-7° and used very frequently in analysis. 


89. (Axiom of choice) For any family of nonempty and mutually nonintersecting 
sets there exists a set C such that for each set X in the family X 1 C consists of 
exactly one element. 


In other words, from each set of the family one can choose exactly one represen- 
tative in such a way that the representatives chosen form a set C. 

The axiom of choice, known as Zermelo’s axiom in mathematics, has been the 
subject of heated debates among specialists. 


1.4.3 Remarks on the Structure of Mathematical Propositions 
and Their Expression in the Language of Set Theory 


In the language of set theory there are two basic, or atomic types of mathematical 
statements: the assertion x € A, that an object x is an element of a set A, and the 


20J, von Neumann (1903-1957) — American mathematician who worked in functional analysis, the 


mathematical foundations of quantum mechanics, topological groups, game theory, and mathemat- 
ical logic. He was one of the leaders in the creation of the first computers. 

21B. Zermelo (1871-1953) — German mathematician. A. Fraenkel (1891-1965) — German (later, 
Israeli) mathematician. 


30 1 Some General Mathematical Concepts and Notation 


assertion A = B, that the sets A and B are identical. (However, when the axiom 
of extensionality is taken into account, the second statement is a combination of 
statements of the first type: (x € A) > (x € B).) 

A complex statement or logical formula can be constructed from atomic state- 
ments by means of logical operators — the connectors =, A, V = and the quantifiers 
V, 4 — by use of parentheses (). When this is done, the formation of any statement, 
no matter how complicated, reduces to carrying out the following elementary logical 
operations: 


a) forming a new statement by placing the negation sign before some statement 
and enclosing the result in parentheses; 

b) forming a new statement by substituting the necessary connectors A, V, and 
=> between two statements and enclosing the result in parentheses. 

c) forming the statement “for every object x property P holds”, (written as 
Vx P(x)) or the statement “there exists an object x having property P” (written 
as dx P(x)). 


For example, the cumbersome expression 


Ax (P(x) A (Vy (P(y) > (y =)))) 


means that there exists an object having property P and such that if y is any ob- 
ject having this property, then y = x. In brief: there exists a unique object x having 
property P. This statement is usually written 4!x P(x), and we shall use this abbre- 
viation. 

To simplify the writing of a statement, as already pointed out, one attempts to 
omit as many parentheses as possible while retaining the unambiguous interpreta- 
tion of the statement. To this end, in addition to the priority of the operators —, A, 
Vv, => mentioned earlier, we assume that the symbols in a formula are most strongly 
connected by the symbols €, =, then J, V, and then the connectors —, A, V, >. 

Taking account of this convention, we can now write 


lx P(x) := 4x (P@) AVy (PQ) > y=x)). 
We also make the following widely used abbreviations: 
(Wx € X) P := Vx (x eX => P(x)), 
(ax € X) P:=4x (xe XA P(Q)), 
(Vx >a) P:=Vx (x ERAx>a> P(x)), 
(ax >a) P :=Ax (x ERAx>aa FPQ)) 


Here R, as always, denotes the set of real numbers. 
Taking account of these abbreviations and the rules a), b), c) for constructing 
complex statements, we can, for example, give an unambiguous expression 


(lim f(@) =a) :=Ve > 0.35 > OVx ER (0 <|x—al <3 > | fa) — A] <e) 
ties 
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of the fact that the number A is the limit of the function f : R — R at the point 
aeéR. 

For us perhaps the most important result of what has been said in this subsection 
will be the rules for forming the negation of a statement containing quantifiers. 

The negation of the statement “for some x, P(x) is true” means that “for any x, 
P(x) is false”, while the negation of the statement “for any x, P(x) is true” means 
that “there exists an x such that P(x) is false’. 

Thus, 

adx P(x) & Vx -P(x), 


aAVx P(x) @ Ax A=P(x). 
We recall also (see the exercises in Sect. 1.1) that 


“(PA Q) > -PV-Q, 
“(PV O)e7PA-@, 
=(P> 0) PA-@. 
On the basis of what has just been said, one can conclude, for example, that 
a((Vx > a)P) & (Ax > a)-P. 
It would of course be wrong to express the right-hand side of this last relation as 
(ax <a)-P. 


Indeed, 


a((¥x > a)P) := 7(¥x (x ERAx >a > P(x))) > 


& Ax (A(x ERAx>a> P(x))) > 
> dx ((x ERAx>a)A =P (x)) =: (dx >a)-P. 
If we take into account the structure of an arbitrary statement mentioned above, 
we can now use the negations just constructed for the simplest statements to form 


the negation of any particular statement. 
For example, 


= (im f@)=A) &Je>0V5>03x ER 
xa 
(0 <|x—al <édA|f(x)—Al|>e). 
The practical importance of the rule for forming a negation is connected, in par- 


ticular, with the method of proof by contradiction, in which the truth of a statement 
P is deduced from the fact that the statement —P is false. 
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1.4.4 Exercises 


1. a) Prove the equipotence of the closed interval {x € R | 0 <x < 1} and the open 
interval {x € R | 0 < x < 1} of the real line R both using the Schréder—Bernstein 
theorem and by direct exhibition of a suitable bijection. 


b) Analyze the following proof of the Schréder—Bernstein theorem: 
(card X < card Y) A (card Y < card X) => (card X = card Y). 


Proof It suffices to prove that if the sets X, Y, and Z are such that X D Y D Z and 
card X = card Z, then card X = cardY. Let f : X — Z be a bijection. A bijection 
g:X — Y can be defined, for example, as follows: 


f(x), ifxe f"(X)\f"() for some n EN, 
8Q)= x otherwise. 
Here f” = f 0---o f is the nth iteration of the mapping f and N is the set of natural 
numbers. 


2. a) Starting from the definition of a pair, verify that the definition of the direct 
product X x Y of sets X and Y given in Sect. 1.4.2 is unambiguous, that is, the set 
P (P(X) UP(Y)) contains all ordered pairs (x, y) in which x € X andyeY. 

b) Show that the mappings f : X — Y from one given set X into another given 
set Y themselves form a set M(X, Y). 

c) Verify that if R is a set of ordered pairs (that is, a relation), then the first 
elements of the pairs belonging to 7 (like the second elements) form a set. 


3. a) Using the axioms of extensionality, pairing, separation, union, and infinity, 
verify that the following statements hold for the elements of the set No of natural 
numbers in the sense of von Neumann: 


spo x= 5": 

2° (Wx ENo) (at £2); 

3° xt aytoxay; 

4° (Wx ENo) (4 49 => (Ay ENo) (x = y*)). 


b) Using the fact that No is an inductive set, show that the following statements 
hold for any of its elements x and y (which in turn are themselves sets): 


1° cardx <cardx™; 
29 card @ <cardxt; 
3° cardx < card y = cardx+ < card yt; 
49 cardx <cardxt: 
5° cardx <card y > cardx* < card y; 
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6° x =y Scardx =card y; 
Te G@GEenvasy: 


c) Show that in any subset X of No there exists a (minimal) element x,, such 
that (Vx € X) (card x, < cardx). (If you have difficulty doing so, come back to this 
problem after reading Chap. 2.) 


4. We shall deal only with sets. Since a set consisting of different elements may 
itself be an element of another set, logicians usually denote all sets by uppercase 
letters. In the present exercise, it is very convenient to do so. 


a) Verify that the statement 
Vx dy Vz (ze y @ dw (Ze wAWEx)) 


expresses the axiom of union, according to which y is the union of the sets belonging 
to x. 
b) State which axioms of set theory are represented by the following statements: 


Vx Vy Vz (z<Eexozey)ex=y), 
Vx Vy az Vu (VEZ (V=xVV=y)), 
Vx dy Vz (cey Vu (Uez>uex)), 
dx (Vy (az (Ze y) > yEx)A 
AVw (wex = Vu (Vo (Ven (V=wVvew)) >uex))). 


c) Verify that the formula 


Vz (z€ f = (Axi ay (x1 Ex AY EYAZ=(1,y1))))A 
AWx (x1 €x => Ay Az (yl EYAZ=(H1, WI AZES))A 
Ax Wy Vy2 (Az1 az2 (21 € f A22€ f Az = (a1, v1) Az2 = (1, y2)) > 
=> VI = y2) 


imposes three successive restrictions on the set f: f is a subset of x x y; the pro- 
jection of f on x is equal to x; to each element x; of x there corresponds exactly 
one y; in y such that (x1, y1) € f. 

Thus what we have here is a definition of a mapping f:x — y. 

This example shows yet again that the formal expression of a statement is by no 
means always the shortest and most transparent in comparison with its expression 
in ordinary language. Taking this circumstance into account, we shall henceforth 
use logical symbolism only to the extent that it seems useful to us to achieve greater 
compactness or clarity of exposition. 


5. Let f : X — Y bea mapping. Write the logical negation of each of the following 
statements: 
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a) f is surjective; 

b) f is injective; 

c) f is bijective. 
6. Let X and Y be sets and f C X x Y. Write what it means to say that the set f is 
not a function. 


Chapter 2 
The Real Numbers 


Mathematical theories, as a rule, find uses because they make it possible to trans- 
form one set of numbers (the initial data) into another set of numbers constituting 
the intermediate or final purpose of the computations. For that reason numerical- 
valued functions occupy a special place in mathematics and its applications. These 
functions (more precisely, the so-called differentiable functions) constitute the main 
object of study of classical analysis. But, as you may already have sensed from your 
school experience, and as will soon be confirmed, any description of the properties 
of these functions that is at all complete from the point of view of modern mathe- 
matics is impossible without a precise definition of the set of real numbers, on which 
these functions operate. 

Numbers in mathematics are like time in physics: everyone knows what they are, 
and only experts find them hard to understand. This is one of the basic mathemati- 
cal abstractions, which seems destined to undergo significant further development. 
A very full separate course could be devoted to this subject. At present we intend 
only to unify what is basically already known to the reader about real numbers from 
high school, exhibiting as axioms the fundamental and independent properties of 
numbers. In doing this, our purpose is to give a precise definition of real numbers 
suitable for subsequent mathematical use, paying particular attention to their prop- 
erty of completeness or continuity, which contains the germ of the idea of passage 
to the limit — the basic nonarithmetical operation of analysis. 


2.1 The Axiom System and Some General Properties of the Set 
of Real Numbers 


2.1.1 Definition of the Set of Real Numbers 


Definition 1 A set R is called the set of real numbers and its elements are real 
numbers if the following list of conditions holds, called the axiom system of the 
real numbers. 
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(I) (AXIOMS FOR ADDITION) An operation 
+:RxR—-R, 


(the operation of addition) is defined, assigning to each ordered pair (x, y) of el- 
ements x, y of R a certain element x + y € R, called the sum of x and y. This 
operation satisfies the following conditions: 


1,. There exists a neutral, or identity element 0 (called zero) such that 
x+0=04+x=x 


for every x ER. 
24. For every element x € R there exists an element —x € R called the negative of 
x such that 


x +(-—x)=(-x)+x=0. 
3. The operation + is associative, that is, the relation 
a+(yty=aty)+z 


holds for any elements x, y,z of R. 
4,. The operation + is commutative, that is, 


X+ya=yrx 


for any elements x, y of R. 


If an operation is defined on a set G satisfying Axioms 1+, 2,, and 3, we say 
that a group structure is defined on G or that G is a group. If the operation is called 
addition, the group is called an additive group. If it is also known that the opera- 
tion is commutative, that is, condition 4, holds, the group is called commutative or 
Abelian. 

Thus, Axioms 1,—4, assert that R is an additive abelian group. 


(ID (AXIOMS FOR MULTIPLICATION) An operation 
e:RxR—-R, 


(the operation of multiplication) is defined, assigning to each ordered pair (x, y) of 
elements x, y of R a certain element x - y € R, called the product of x and y. This 
operation satisfies the following conditions: 


'N.H. Abel (1802-1829) — outstanding Norwegian mathematician, who proved that the general 
algebraic equation of degree higher than four cannot be solved by radicals. 
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1,. There exists a neutral, or identity element | € R\O (called one) such that 


for every x ER. 
2.. For every element x € R\0 there exists an element x~! € R, called the inverse 
or reciprocal of x, such that 


3.. The operation e is associative, that is, the relation 
X-(y:zZ)=(X-y)-zZ 


holds for any elements x, y,z of R. 
4,. The operation e is commutative, that is, 


X-y=y-x 
for any elements x, y of R. 


We remark that with respect to the operation of multiplication the set R\0, as one 
can verify, is a (multiplicative) group. 


(I, 11) (THE CONNECTION BETWEEN ADDITION AND MULTIPLICATION) Multi- 
plication is distributive with respect to addition, that is 


(x+y)z=xz+ yz 


forallx,y,zER. 


We remark that by the commutativity of multiplication, this equality continues to 
hold if the order of the factors is reversed on either side. 

If two operations satisfying these axioms are defined on a set G, then G is called 
a field. 


(IIT) (ORDER AXIOMS) Between elements of R there is a relation <, that is, for 
elements x, y € R one can determine whether x < y or not. Here the following 
conditions must hold: 


O<. Vx ER (& <x). 

le. @SyY)AQ Sx) > OH=Yy). 
2<. (x<y)AQSID>(* 2). 

3<. Vx ERVyER(Q@K<y)VO <x). 


The relation < on R is called inequality. 
A set on which there is a relation between pairs of elements satisfying Ax- 
ioms O<, l<, and 2<, as you know, is said to be partially ordered. Tf in addition 
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Axiom 3< holds, that is, any two elements are comparable, the set is linearly or- 
dered. Thus the set of real numbers is linearly ordered by the relation of inequality 
between elements. 


(I, I) (THE CONNECTION BETWEEN ADDITION AND ORDER ON R) Jf x, y, z are 
elements of R, then 


(x<y)>(44+z<y4+z). 


(II, 11) (THE CONNECTION BETWEEN MULTIPLICATION AND ORDER ON R) /f x 
and y are elements of R, then 


(O<x)AO<y>OK<x-y). 


(IV) (THE AXIOM OF COMPLETENESS (CONTINUITY)) Jf X and Y are nonempty 
subsets of R having the property that x < y for every x € X and every y € Y, then 
there exists c € R such that x <c<yforallx € X andy€Y. 


We now have a complete list of axioms such that any set on which these axioms 
hold can be considered a concrete realization or model of the real numbers. 

This definition does not formally require any preliminary knowledge about num- 
bers, and from it “by turning on mathematical thought” we should, again formally, 
obtain as theorems all the other properties of real numbers. On the subject of this 
axiomatic formalism we would like to make a few informal remarks. 

Imagine that you had not passed from the stage of adding apples, cubes, or other 
named quantities to the addition of abstract natural numbers; you had not studied the 
measurement of line segments and arrived at rational numbers; you did not know the 
great discovery of the ancients that the diagonal of a square is incommensurable with 
its side, so that its length cannot be a rational number, that is, that irrational numbers 
are needed; you did not have the concept of “greater” or “smaller” that arises in the 
process of measurement; you did not picture order to yourself using, for example, 
the real line. If all these preliminaries had not occurred, the axioms just listed would 
not be perceived as the outcome of intellectual progress; they would seem at the 
very least a strange, and in any case arbitrary, fruit of the imagination. 

In relation to any abstract system of axioms, at least two questions arise immedi- 
ately. 

First, are these axioms consistent? That is, does there exist a set satisfying all the 
conditions just listed? This is the problem of consistency of the axioms. 

Second, does the given system of axioms determine the mathematical object 
uniquely? That is, as the logicians would say, is the axiom system categorical? Here 
uniqueness must be understood as follows. If two people A and B construct models 
independently, say of number systems Ry, and Rg, satisfying the axioms, then a 
bijective correspondence can be established between the systems Ry, and Rg, say 
f : Ra — Rg, preserving the arithmetic operations and the order, that is, 


fatty) = fo)t+ fo), 
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f(x-y) = f@)- fO), 
x<yo f(x) < fQ). 


In this case, from the mathematical point of view, R4 and Rg are merely distinct 
but equally valid realizations (models) of the real numbers (for example, R4 might 
be the set of infinite decimal fractions and Rg the set of points on the real line). Such 
realizations are said to be isomorphic and the mapping f is called an isomorphism. 
The result of this mathematical activity is thus not about any particular realization, 
but about each model in the class of isomorphic models of the given axiom system. 

We shall not discuss the questions posed above, but instead confine ourselves to 
giving informative answers to them. 

A positive answer to the question of consistency of an axiom system is always 
of a hypothetical nature. In relation to numbers it has the following appearance: 
Starting from the axioms of set theory that we have accepted (see Sect. 1.4.2), one 
can construct the set of natural numbers, then the set of rational numbers, and finally 
the set R of real numbers satisfying all the properties listed. 

The question of the categoricity of the axiom system for the real numbers can 
be established. Those who wish to do so may obtain it independently by solving 
Exercises 23 and 24 at the end of this section. 


2.1.2 Some General Algebraic Properties of Real Numbers 


We shall show by examples how the known properties of numbers can be obtained 
from these axioms. 

a. Consequences of the Addition Axioms 

1°. There is only one zero in the set of real numbers. 


Proof If 0; and 02 are both zeros in R, then by definition of zero, 


0; = 0; +02 = 02 + 0; = 0p. 


2°. Each element of the set of real numbers has a unique negative. 


Proof If x; and x2 are both negatives of x € R, then 


XpHxytOHxyt+ (e442) = (41 +4) $42 =04+22 =. 


Here we have used successively the definition of zero, the definition of the neg- 
ative, the associativity of addition, again the definition of the negative, and finally, 
again the definition of zero. 
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3°. In the set of real numbers R the equation 
a+x=b 
has the unique solution 
x =b+(-a). 


Proof This follows from the existence and uniqueness of the negative of every ele- 
menta ER: 


(a+x=b) = ((xv +a) + (-a) =b+ (-a)) & 
& (x + (a + (-a)) =b + (-a)) @ (x +0=b+4 (-a)) & 


4 (x =b+ (—a)). 


The expression b + (—a) can also be written as b — a. This is the shorter and 
more common way of writing it, to which we shall adhere. 


b. Consequences of the Multiplication Axioms 


1°. There is only one multiplicative unit in the real numbers. 


2°. For each x #0 there is only one reciprocal x7'. 


3°. For a € R\O, the equation a - x =b has the unique solution x =b-a7'. 


The proofs of these propositions, of course, merely repeat the proofs of the corre- 
sponding propositions for addition (except for a change in the symbol and the name 
of the operation); they are therefore omitted. 


c. Consequences of the Axiom Connecting Addition and Multiplication 


Applying the additional axiom (I, II) connecting addition and multiplication, we 
obtain further consequences. 


1°. Foranyx €R 


x-0=0-x=0. 


Proof 


(x-0=x-(0+0)=x-0+x-0) > (x-0=x-0+ (—(x-0)) =0). 


From this result, incidentally, one can see that if x € R\0, then x~! € R\O. 


2°. (x-y=0) > (x =0)V (y=0). 
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Proof {f, for example, y 4 0, then by the uniqueness of the solution of the equation 
x-y=0 for x, we find x =0- y~! =0. 


3°. For anyx €R 
—x=(-1)-x. 


Proof x +(—1)-x =(1+(-1))-x =0-x =x-0=0, and the assertion now follows 
from the uniqueness of the negative of a number. 


4°. Foranyx ER 
(-lI(-x) =x. 


Proof This follows from 3° and the uniqueness of the negative of —x. 


5°. For anyx €R 


(—x)-(-—x)=x-x. 
Proof 


(=x)(=x) = ((-1) -x)(-x) = (« (ED) (Hx) = x ((- D(C) =o. 


Here we have made successive use of the preceding propositions and the commuta- 
tivity and associativity of multiplication. 


d. Consequences of the Order Axioms 


We begin by noting that the relation x < y (read “x is less than or equal to y”) can 
also be written as y > x (“y is greater than or equal to x”); when x ¥ y, the relation 
x < y is written x < y (read “x is less than y”) or y > x (read “‘y is greater than x”’), 
and is called strict inequality. 


1°. For any x and y inR precisely one of the following relations holds: 
x<y, x=y, x>y. 


Proof This follows from the definition of strict inequality just given and Axioms |< 
and 3<. 


2°. For anyx,y,z€R 


(x<y)AQS2>( <2), 


(x<yAQ<z)>( <z). 
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Proof We prove the first assertion as an example. By Axiom 2<, which asserts that 
the inequality relation is transitive, we have 


AEWA <7IEARSWNAYVSEDAVEAIDS> (@ S2). 
It remains to be verified that x ~ z. But if this were not the case, we would have 
ASWAYVKDSEESWNAYVK<DSCSWNAYVSEIDAVEY. 
By Axiom 1< this relation would imply 


(y=Z AQ #2), 


which is a contradiction. 


e. Consequences of the Axioms Connecting Order with Addition and 
Multiplication 


If in addition to the axioms of addition, multiplication, and order, we use Ax- 
ioms (I, III) and (II, III), which connect the order with the arithmetic operations, 
we can obtain, for example, the following propositions. 


1°. Foranyx,y,z,weER 


(~<y)> @+z2)<(yt+2), 
(O<x) => (—x <0), 
~s<WACSwW)>A+z2)<+w), 
(x<y)AZ<w>(a+z<ytuw). 


Proof We shall verify the first of these assertions. 
By definition of strict inequality and the axiom (I, III) we have 


@<y>@syS@+)S012). 
It remains to be verified that x + z 4 y + z. Indeed, 


(@+2=(4+2))> (#=04+9-z=y+@-2=)y), 


which contradicts the assumption x < y. 


2°. If x,y,z €R, then 


(O<x)AO<y)> O<xy), 
(x <0) A (y <0) S (0 < xy), 
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(x <0) A (0 < y) > (ry <0), 
(x <y)A (0 <z) => (xz < yz), 
(x <y)A(Z <0) > 0z < xz). 


Proof We shall verify the first of these assertions. By definition of strict inequality 
and the axiom (II, III) we have 


O<x)AO<y>O<xAOSy)> Oxy). 


Moreover, 0 ¥ xy since, as already shown, 


(x-y=0) > (x =0) Vv (y=0). 
Let us further verify, for example, the third assertion: 
(x <0 AO<y)>O<-x)AO<y)> 
=> (0 <(-x)-y) > (0<((-1)-x)y) > 
= (0 <(-1)- (xy) > (0 < -(@ry)) > (xy <0). 


The reader is now invited to prove the remaining relations independently and also 
to verify that if nonstrict inequality holds in one of the parentheses on the left-hand 
side, then the inequality on the right-hand side will also be nonstrict. 


39. 0<1. 


Proof We know that 1 € R\0, that is 0 4 1. If we assume | < 0, then by what was 
just proved, 


(1<0A1<0)>50<1-)D>S>(0<D. 


But we know that for any pair of numbers x, y € R exactly one of the possibilities 
x <y,x=y,x > y actually holds. Since 0 ¥ 1 and the assumption | < 0 implies 
the relation 0 < 1, which contradicts it, the only remaining possibility is the one in 
the statement of the proposition. 


1 


4°. 0<x)> (0<x7!) and O<x) A(x <y)SO<y ay! <x74). 


Proof Let us verify the first of these assertions. First of all, x~! 4 0. Assuming 
ee 0, we obtain 


(x7! <0) A (<x) => (x-x7' <0) 5 (1 <0). 


This contradiction completes the proof. 


We recall that numbers larger than zero are called positive and those less than 
Zero negative. 
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Thus we have shown, for example, that 1 is a positive number, that the product 
of a positive and a negative number is a negative number, and that the reciprocal of 
a positive number is also positive. 


2.1.3 The Completeness Axiom and the Existence of a Least Upper 
(or Greatest Lower) Bound of a Set of Numbers 


Definition 2 A set X C R is said to be bounded above (resp. bounded below) if 
there exists a number c € R such that x <c (resp. c < x) forallx € X. 


The number c in this case is called an upper bound (resp. lower bound) of the 
set X. It is also called a majorant (resp. minorant) of X. 


Definition 3 A set that is bounded both above and below is called bounded. 


Definition 4 An element a € X is called the largest or maximal (resp. smallest or 
minimal) element of X if x <a (resp. a <x) for all x € X. 


We now introduce some notation and at the same time give a formal expression 
to the definition of maximal and minimal elements: 


(a = max X) := (ae XAWxEX (x <a)), 
(a= min X) := (aeXAVxeX (a <x)). 


Along with the notation max X (read “the maximum of X”’) and min X (read “the 
minimum of X”’) we also use the respective expressions max;cy x and Miny<cy xX. 

It follows immediately from the order Axiom | < that if there is a maximal (resp. 
minimal) element in a set of numbers, it is the only one. 

However, not every set, not even every bounded set, has a maximal or minimal 
element. 

For example, the set X = {x € R| 0 < x < 1} has a minimal element. But, as one 
can easily verify, it has no maximal element. 


Definition 5 The smallest number that bounds a set X C R from above is called the 
least upper bound (or the exact upper bound) of X and denoted sup X (read “the 
supremum of X”’) or sup, cy x. 

This is the basic concept of the present subsection. Thus 


(s = sup X) :=Wx € X (x <5) A (Vs! <5 Ax’ € X (s' <x’))). 


The expression in the first set of parentheses on the right-hand side here says that 
s is an upper bound for X; the expression in the second set says that s is the smallest 
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number having this property. More precisely, the expression in the second set of 
parentheses asserts that any number smaller than s is not an upper bound of X. 

The concept of the greatest lower bound (or exact lower bound) of a set X is 
introduced similarly as the largest of the lower bounds of X. 


Definition 6 
(i = inf X) = Vx € X (<x) A (Wi' >i Ax’ EX (x’ <i’). 


Along with the notation inf X (read “the infimum of X”’) one also uses the nota- 
tion infy<x x for the greatest lower bound of X. 
Thus we have given the following definitions: 


sup X := min{c ER | Vx € X (x <o)}, 
inf X := max{ce R| Vx eX (c<x)}. 


But we said above that not every set has a minimal or maximal element. There- 
fore the definitions we have adopted for the least upper bound and greatest lower 
bound require an argument, provided by the following lemma. 


Lemma (The least upper bound principle) Every nonempty set of real numbers that 
is bounded from above has a unique least upper bound. 


Proof Since we already know that the minimal element of a set of numbers is 
unique, we need only verify that the least upper bound exists. 

Let X CR be a given set and Y = {y € R| Vx € X (x < y)}. By hypothesis, 
X # and Y 4 @. Then, by the completeness axiom there exists c € R such that 
Vx € X Vy €Y (x <c < y). The number c is therefore both a majorant of X anda 
minorant of Y. Being a majorant of X, c is an element of Y. But then, as a minorant 
of Y, it must be the minimal element of Y. Thus c= min Y = sup X. 


Naturally the existence and uniqueness of the greatest lower bound of a nonempty 
set of numbers that is bounded from below is analogous, that is, the following propo- 
sition holds. 


Lemma (X nonempty and bounded below) => (Alinf X). 


We shall not take time to give the proof. 

We now return to the set X = {x € R| 0 <x < 1}. By the lemma just proved it 
must have a least upper bound. By the very definition of the set X and the definition 
of the least upper bound, it is obvious that sup X < 1. 

To prove that sup X = | it is thus necessary to verify that for any number g < 1 
there exists x € X such that g < x; simply put, this means merely that there are 
numbers between g and 1. This of course, is also easy to prove independently (for 
example, by showing that g < 2~!(q + 1) < 1), but we shall not do so at this point, 
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since such questions will be discussed systematically and in detail in the next sec- 
tion. 

As for the greatest lower bound, it always coincides with the minimal element 
of a set, if such an element exists. Thus, from this consideration alone we have 
inf X = 0 in the present example. 

Other, more substantive examples of the use of the concepts introduced here will 
be encountered in the next section. 


2.2 The Most Important Classes of Real Numbers 
and Computational Aspects of Operations with Real 
Numbers 


2.2.1 The Natural Numbers and the Principle of Mathematical 
Induction 


a. Definition of the Set of Natural Numbers 


The numbers of the form 1, 1+ 1, (1+ 1)+ 1, and so forth are denoted respectively 
by 1, 2,3,... and so forth and are called natural numbers. 

Such a definition will be meaningful only to one who already has a complete 
picture of the natural numbers, including the notation for them, for example in the 
decimal system of computation. 

The continuation of such a process is by no means always unique, so that the 
ubiquitous “and so forth” actually requires a clarification provided by the funda- 
mental principle of mathematical induction. 


Definition 1 A set X C R is inductive if for each number x € X, it also contains 
x+1. 


For example, R is an inductive set; the set of positive numbers is also inductive. 

The intersection X =(),<4 Xq of any family of inductive sets Xj, if not empty, 
is an inductive set. 

Indeed, 


(xexX=(] Xx) > (Weed weXy)) > 


> (Wwe A (r+ Xs) > (+ DE] Xe=X). 


acA 


We now adopt the following definition. 


Definition 2 The set of natural numbers is the smallest inductive set containing 1, 
that is, the intersection of all inductive sets that contain 1. 
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The set of natural numbers is denoted N; its elements are called natural numbers. 

From the set-theoretic point of view it might be more rational to begin the natural 
numbers with 0, that is, to introduce the set of natural numbers as the smallest 
inductive set containing 0; however, it is more convenient for us to begin numbering 
with 1. 

The following fundamental and widely used principle is a direct corollary of the 
definition of the set of natural numbers. 


b. The Principle of Mathematical Induction 


If a subset E of the set of natural numbers N is such that 1 € E and together with 
each number x € E, the number x + 1 also belongs to E, then E=N. 
Thus, 


(ECN)A(LEL)A(xEESX+DE£L)SEEN. 


Let us illustrate this principle in action by using it to prove several useful prop- 
erties of the natural numbers that we will be using constantly from now on. 


1°. The sum and product of natural numbers are natural numbers. 


Proof Let m,n € N; we shall show that (m+n) € N. We denote by E the set of 
natural numbers n for which (m+n) € N for all m € N. Then 1 € E since (m € 
N) > (m+ 1) EN) for any m EN. Ifne E, thatis, (m+n) EN, then(n+1)€ EF 
also, since (m+ (n+ 1)) = ((m+n)+ 1) EN. By the principle of induction, E = N, 
and we have proved that addition does not lead outside of N. 

Similarly, taking E to be the set of natural numbers n for which (m - n) € N for 
all m EN, we find that 1 € E, since m- 1 =m, and ifn € E, that is, m-n €N, then 
m-(n+1)=mn 4m is the sum of two natural numbers, which belongs to N by 
what was just proved above. Thus (1 € E) > ((n + 1) € E), and so by the principle 
of induction E =N. 


2°. MeN AnF¢1) > (n—-1L EN). 


Proof Consider the set FE consisting of all real numbers of the form n — 1, where 
n is a natural number different from 1; we shall show that E =N. Since 1 EN, it 
follows that 2:= (1+ 1)¢Nandhence 1=(2-—1)€E. 

If m € E, then m =n — 1, where n € N; then m+ 1=(n+ 1) — 1, and since 
n+1e€N, we have (m+ 1) € E. By the principle of induction we conclude that 
E=N. 


3°. For any n €N the set {x € N|n < x} contains a minimal element, namely 


min{x eN|na<x}s=n+1. 
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Proof We shall show that the set E of n € N for which the assertion holds coincides 
with N. 
We first verify that 1 € E, that is, 


min{x €N|1<x}=2. 
We shall also verify this assertion by the principle of induction. Let 
M={xeN|@=1)V2<x)}. 


By definition of M we have | € M. Then if x € M, either x = 1, in which case 
x+1=2€M, or else 2 < x, and then 2 < (x + 1), and once again (x + 1) € M. 
Thus M =N, and hence if (x £1) A (x EN), then 2 < x, that is, indeed min{x € 
N|1<x}=2.Hence le E. 

We now show that ifn € FE, then (n+ 1) € E. 

We begin by remarking that if x € {x eN|n+1 <x}, then 


(x-l=yefyeN|n< y}. 


For, by what has already been proved, every natural number is at least as large as 1; 
therefore (n+1<x)=> (1 <x) => (x ¥$ 1), and then by the assertion in 2° we have 
(x-l=yeN. 

Now let n € E, that is, minfye N|n < y}=n+1.Thenx —1>y>n-+1 and 
x >n-+2. Hence, 


(x €{x EN|n+1<x})>(@>n+2) 


and consequently, minfx € N[|n+1<x}=n+2, thatis,(n+1)€E. 
By the principle of induction E = N, and 3° is now proved. 


As immediate corollaries of 2° and 3° above, we obtain the following properties 
(49, 59. and 6°) of the natural numbers. 


4°. (mEeN)AEN A(n<m) > (n+1<m). 


5°. The number (n + 1) € N is the immediate successor of the number n € N; that 
is, if n EN, there are no natural numbers x satisfyingn<x<n+1. 


6°. If né€N andnF1, then (n— 1) € Nand (n — 1) is the immediate predecessor 
of n in N; that is, if n € N, there are no natural numbers x satisfying n — 1 < 
x<n. 


We now prove one more property of the set of natural numbers. 
7°. Inany nonempty subset of the set of natural numbers there is a minimal element. 
Proof Let M CN. If 1 € M, then min M = 1, since Vn EN (1 <n). 


Now suppose | ¢ M, that is, 1 ¢ E = N\M. The set E must contain a natu- 
ral number n such that all natural numbers not larger than n belong to E, but 
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(n+ 1) € M. If there were no such n, the set E C N, which contains 1, would 
contain along with each of its elements n, the number (7 + 1) also; by the prin- 
ciple of induction, it would therefore equal N. But the latter is impossible, since 
N\E=MFZ@. 

The number (1 + 1) so found must be the smallest element of M, since there are 
no natural numbers between n and n + 1, as we have seen. 


2.2.2 Rational and Irrational Numbers 


a. The Integers 


Definition 3 The union of the set of natural numbers, the set of negatives of natural 
numbers, and zero is called the set of integers and is denoted Z. 


Since, as has already been proved, addition and multiplication of natural numbers 
do not take us outside N, it follows that these same operations on integers do not 
lead outside of Z. 


Proof Indeed, if m,n € Z, either one of these numbers is zero, and then the sum 
m +n equals the other number, so that (m +n) € Z and m-n=0€ Z, or both 
numbers are non-zero. In the latter case, either m,n € N and then (m+n) ENCZ 
and (m-n) € NC Z, or (—m), (—n) € N and then m -n = ((—1)m)((—1)n) € N or 
(—m),n € N and then (—m -n) EN, that is, m-n € Z, or, finally, m, —n € N and 
then (—m -n) € N and once again m-n € Z. 


Thus Z is an Abelian group with respect to addition. With respect to multiplica- 
tion Z is not a group, nor is Z\0, since the reciprocals of the integers are not in Z 
(except the reciprocals of | and —1). 


Proof Indeed, if m € Z and m 4 0,1, then assuming first that m € N, we have 
0 <1 <~m, and, sincem-m7!=1> 0, we must have 0 < ml<1 (see the con- 
sequences of the order axioms in the previous subsection). Thus m~! ¢ Z. The case 
when m is a negative integer different from —1 reduces immediately to the one 
already considered. 


When k =m-n7! € Z for two integers m,n € Z, that is, when m = k-n for some 
k € Z, we say that m is divisible by n or a multiple of n, or that n is a divisor of m. 

The divisibility of integers reduces immediately via suitable sign changes, that is, 
through multiplication by —1 when necessary, to the divisibility of the correspond- 
ing natural numbers. In this context it is studied in number theory. 

We recall without proof the so-called fundamental theorem of arithmetic, which 
we shall use in studying certain examples. 

A number p EN, p $1, is prime if it has no divisors in N except 1 and p. 
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The fundamental theorem of arithmetic Each natural number admits a represen- 
tation as a product 


n= Pi1-** Pk» 


where pj,..., Pk are prime numbers. This representation is unique except for the 
order of the factors. 


Numbers m,n € Z are said to be relatively prime if they have no common divi- 
sors except | and -1. 

It follows in particular from this theorem that if the product m - n of relatively 
prime numbers m and n is divisible by a prime p, then one of the two numbers is 
also divisible by p. 


b. The Rational Numbers 
Definition 4 Numbers of the form m-n7!, where m,n € Z, are called rational. 


We denote the set of rational numbers by Q. 

Thus, the ordered pair (m, n) of integers defines the rational number g =m -n— 
ifn £0. 

The number g = m-n~~ can also be written as a quotient of m and n, that is, as 
a so-called rational fraction 7. 

The rules you learned in school for operating with rational numbers in terms of 
their representation as fractions follow immediately from the definition of a rational 
number and the axioms for real numbers. In particular, “the value of a fraction is 
unchanged when both numerator and denominator are multiplied by the same non- 
zero integer’, that is, the fractions te and " represent the same rational number. In 
fact, since (nk)(k~!n7!) = 1, that is (n- k)~! =k7!-n7—!, we have (mk)(nk)~! = 
(mk)(k7!n7!) =m-n7!, 

Thus the different ordered pairs (m,n) and (mk, nk) define the same rational 
number. Consequently, after suitable reductions, any rational number can be pre- 
sented as an ordered pair of relatively prime integers. 

On the other hand, if the pairs (1,1) and (m2,n2) define the same rational 
number, that is, 77 - Ay =m: figs then m 1n2 = m2n1, and if, for example, m, 
and n, are relatively prime, it follows from the corollary of the fundamental theorem 
of arithmetic mentioned above that no - i =m?- m,! =keZ. 

We have thus demonstrated that two ordered pairs (m 1,1) and (m2, n2) define 
the same rational number if and only if they are proportional. That is, there exists 
an integer k € Z such that, for example, m2 = km, and nz =kn,. 


1 


1 


>The notation Q comes from the first letter of the English word quotient, which in turn comes from 
the Latin quota, meaning the unit part of something, and quat, meaning how many. 
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c. The Irrational Numbers 
Definition 5 The real numbers that are not rational are called irrational. 


The classical example of an irrational real number is A/D: that is, the number 
s €R such that s > 0 and s* = 2. By the Pythagorean theorem, the irrationality of 
/2 is equivalent to the assertion that the diagonal and side of a square are incom- 
mensurable. 

Thus we begin by verifying that there exists a real number s € R whose square 
equals 2, and then that s ¢ Q. 


Proof Let X and Y be the sets of positive real numbers such that Vx € X (x* <2), 
Vy €¥ (2 <y’). Since 1 € X and 2 € Y, it follows that X and Y are nonempty sets. 

Further, since (x < y) < (x? < y*) for positive numbers x and y, every element 
of X is less than every element of Y. By the completeness axiom there exists s € R 
such that x <s < y forallx € X andallyeY. 

We shall show that s* = 2. 

If s? <2, then, for example, the number s + a which is larger than s, would 
have a square less than 2. Indeed, we know that | € X, so that 1? <5? <2, and 
0 < A:=2-—s7 <1. It follows that 


ar A PBN Ss A x A 4 
st—) =s°4+2-—4(—) <58°4+3-— <58°4+3-—=5°4+A=2. 

35 35 3s 35 
Consequently, (s + £) € X, which is inconsistent with the inequality x < s for all 
xex, ‘ 

If 2 < 5”, then the number s — £ a which is smaller than s, would have a 

square larger than 2. Indeed, we know that 2 € Y, so that 2 < sy? <2 or0<A:= 
s?—2<3and0< 4 < 1. Hence, 


BN 2 sy Be PAN og Ay as 
s——]) =s°—-2-—4+(—) >s°-3-—=5°-A=2, 
35 35 35 3s 


and we have now contradicted the fact that s is a lower bound of Y. 

Thus the only remaining possibility is that s* = 2. 

Let us show, finally, that s ¢ Q. Assume that s € Q and let “ be an irreducible 
representation of s. Then m* = 2-n?, so that m? is divisible by 2 and therefore m 
also is divisible by 2. But, if m = 2k, then 2k? =n”, and for the same reason, n must 
be divisible by 2. But this contradicts the assumed irreducibility of the fraction *. 


We have worked hard just now to prove that there exist irrational numbers. We 
shall soon see that in a certain sense nearly all real numbers are irrational. It will be 
shown that the cardinality of the set of irrational numbers is larger than that of the 
set of rational numbers and that in fact the former equals the cardinality of the set 
of real numbers. 
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Among the irrational numbers we make a further distinction between the so- 
called algebraic irrational numbers and the transcendental numbers. 
A real number is called algebraic if it is the root of an algebraic equation 


aox" + +++ + ap—1x + an =0 


with rational (or equivalently, integer) coefficients. 

Otherwise the number is called transcendental. 

We shall see that the cardinality of the set of algebraic numbers is the same as 
that of the set of rational numbers, while the cardinality of the set of transcendental 
numbers is the same as that of the set of real numbers. For that reason the difficulties 
involved in exhibiting specific transcendental numbers — more precisely, proving 
that a given number is transcendental — seem at first sight paradoxical and unnatural. 

For example, it was not proved until 1882 that the classical geometric number zr 
is transcendental,’ and one of the famous Hilbert* problems was to prove the tran- 
scendence of the number a’, where a is algebraic, (a > 0) A (a $ 1) and £ is an 
irrational algebraic number (for example, a = 2, 6 = /2). 


2.2.3 The Principle of Archimedes 


We now turn to the principle of Archimedes,° which is important in both its theoret- 
ical aspect and the application of numbers in measurement and computations. We 
shall prove it using the completeness axiom (more precisely, the least-upper-bound 
principle, which is equivalent to the completeness axiom). In other axiom systems 
for the real numbers this fundamental principle is frequently included in the list of 
axioms. 

We remark that the propositions that we have proved up to now about the natural 
numbers and the integers have made no use at all of the completeness axiom. As 


3The number equals the ratio of the circumference of a circle to its diameter in Euclidean geom- 
etry. That is the reason this number has been conventionally denoted since the eighteenth century, 
following Euler by zr, which is the initial letter of the Greek word wepiyé pia — periphery (cir- 
cumference). The transcendence of z was proved by the German mathematician F. Lindemann 
(1852-1939). It follows in particular from the transcendence of z that it is impossible to construct 
a line segment of length 2 with compass and straightedge (the problem of rectification of the cir- 
cle), and also that the ancient problem of squaring the circle cannot be solved with compass and 
straightedge. 


4D, Hilbert (1862-1943) — outstanding German mathematician who stated 23 problems from dif- 
ferent areas of mathematics at the 1900 International Congress of Mathematicians in Paris. These 
problems came to be known as the “Hilbert problems”. The problem mentioned here (Hilbert’s sev- 
enth problem) was given an affirmative answer in 1934 by the Soviet mathematician A.O. Gel’ fond 
(1906-1968) and the German mathematician T. Schneider (1911-1989). 


5 Archimedes (287-212 BCE) — brilliant Greek scholar, about whom Leibniz, one of the founders of 
analysis said, “When you study the works of Archimedes, you cease to be amazed by the achieve- 
ments of modern mathematicians.” 
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will be seen below, the principle of Archimedes essentially reflects the properties of 
the natural numbers and integers connected with completeness. We begin with these 
properties. 


1°. Any nonempty subset of natural numbers that is bounded from above contains 
a maximal element. 


Proof If E CN is the subset in question, then by the least-upper-bound lemma, 
d!sup E = s € R. By definition of the least upper bound there is a natural number 
n € E satisfying the condition s— 1 <n <s. But then, n = max E£, since a natural 
number that is larger than n must be at leastn + 1,andn+1>5. 


Corollaries 2° The set of natural numbers is not bounded above. 


Proof Otherwise there would exist a maximal natural number. But n <n + 1. 


3°, Any nonempty subset of the integers that is bounded from above contains a 
maximal element. 


Proof The proof of 1° can be repeated verbatim, replacing N with Z. 


4°. Any nonempty subset of integers that is bounded below contains a minimal ele- 
ment. 


Proof One can, for example, repeat the proof of 1°, replacing N by Z and using the 

greatest-lower-bound principle instead of the least-upper-bound principle. 
Alternatively, one can pass to the negatives of the numbers (“change signs’) and 

use what has been proved in 3°. 


5°. The set of integers is unbounded above and unbounded below. 


Proof This follows from 3° and 4°, or directly from 2°. 


We can now state the principle of Archimedes. 


6°. (The principle of Archimedes). For any fixed positive number h and any real 
number x there exists a unique integer k such that (k — l)h<x <kh. 


Proof Since Z is not bounded above, the set {n € Z| 5 <n} is a nonempty subset 
of the integers that is bounded below. Then (see 4°) it contains a minimal element k, 
that is (k — 1) <x/h <k. Since h > 0, these inequalities are equivalent to those 
given in the statement of the principle of Archimedes. The uniqueness of k € Z 
satisfying these two inequalities follows from the uniqueness of the minimal element 
of a set of numbers (see Sect. 2.1.3). 


54 2 The Real Numbers 


And now some corollaries: 


7°. For any positive number ¢ there exists a natural number n such that 0 < ‘ <é. 


Proof By the principle of Archimedes there exists n € Z such that 1 < ¢-n. Since 
0 < 1 and 0 <e, we have 0 <n. Thusn € N and0 < ‘ <é. 


8°. If the number x €R is such that 0 < x and x < 1 for alln EN, then x =0. 


Proof The relation 0 < x is impossible by virtue of 7°. 


9°. For any numbers a,b € R such that a < b there is a rational number r € Q 
such thata <r <b. 


Proof Taking account of 7°, we choose n € N such that 0 < i < b—a. By the 
principle of Archimedes we can find a number m € Z such that wt <a< %. Then 
“ <b, since otherwise we would have mat <a<b<%, from which it would 


follow that + > b—a. Thusr = €Qanda<@ <b. 


10°. For any number x €R there exists a unique integer k € Z such that k < x < 
k+1. 


Proof This follows immediately from the principle of Archimedes. 


The number & just mentioned is denoted [x] and is called the integer part of x. 
The quantity {x} := x — [x] is called the fractional part of x. Thus x = [x] + {x}, 
and {x} > 0. 


2.2.4 The Geometric Interpretation of the Set of Real Numbers 
and Computational Aspects of Operations with Real 
Numbers 


a. The Real Line 


In relation to real numbers we often use a descriptive geometric language connected 
with a fact that you know in general terms from school. By the axioms of geometry 
there is a one-to-one correspondence f : L — R between the points of a line L 
and the set R of real numbers. Moreover this correspondence is connected with 
the rigid motions of the line. To be specific, if T is a parallel translation of the 
line L along itself, there exists a number ¢ € R (depending only on T) such that 
f(T (x)) = f(x) +t for each point x € L. 

The number f(x) corresponding to a point x € L is called the coordinate of x. In 
view of the one-to-one nature of the mapping f : L — R, the coordinate of a point is 
often called simply a point. For example, instead of the phrase “let us take the point 
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whose coordinate is 1” we say “let us take the point 1”. Given the correspondence 
f :L— R, we call the line L the coordinate axis or the number axis or the real line. 
Because f is bijective, the set R itself is also often called the real line and its points 
are called points of the real line. 

As noted above, the bijective mapping f : L— R that defines coordinates on L 
has the property that under a parallel translation T the coordinates of the images 
of points of the line L differ from the coordinates of the points themselves by a 
number ¢ € R, the same for every point. For this reason f is determined completely 
by specifying the point that is to have coordinate 0 and the point that is to have 
coordinate |, or more briefly, by the point 0, called the origin, and the point |. The 
closed interval determined by these points is called the unit interval. The direction 
determined by the ray with origin at 0 containing | is called the positive direction 
and a motion in that direction (from 0 to 1) is called a motion from left to right. In 
accordance with this convention, | lies to the right of 0 and 0 to the left of 1. 

Under a parallel translation T that moves the origin xo to the point x1 = T (xo) 
with coordinate 1, the coordinates of the images of all points are one unit larger 
than those of their pre-images, and therefore we locate the point x2 = T(x;) with 
coordinate 2, the point x3 = T(x2) with coordinate 3,..., and the point x,+1 = 
T (x) with coordinate n + 1, as well as the point x_; = T~'(xo) with coordinate 
—1,..., the point x_,_-1 = TO! (x_n) with coordinate —n — 1. In this way we obtain 
all points with integer coordinates m € Z. 

Knowing how to double, triple, ... the unit interval, we can use Thales’ theorem 
to partition this interval into n congruent subintervals. By taking the subinterval 
having an endpoint at the origin, we find that the coordinate of its other end, which 
we denote by x, satisfies the equation n - x = 1, that is, x = i. From this we find all 
points with rational coordinates * € Q. 

But there still remain points of LL, since we know there are intervals in- 
commensurable with the unit interval. Each such point, like every other point of the 
line, divides the line into two rays, on each of which there are points with integer 
or rational coordinates. (This is a consequence of the original geometric principle 
of Archimedes.) Thus a point produces a partition, or, as it is called, a cut of Q 
into two nonempty sets X and Y corresponding to the rational points (points with 
rational coordinates) on the left-hand and right-hand rays. By the axiom of com- 
pleteness, there is a number c that separates X and Y, that is, x <c < y for all 
x € X andall ye Y. Since X UY =Q, it follows that sup X = s =i = inf Y. For 
otherwise, s <i and there would be a rational number between s and i lying neither 
in X nor in Y. Thus s =i =c. This uniquely determined number c is assigned to 
the corresponding point of the line. 

The assignment of coordinates to points of the line just described provides a 
visualizable model for both the order relation in R (hence the term “linear ordering”) 
and for the axiom of completeness or continuity in R, which in geometric language 
means that there are no “holes” in the line L, which would separate it into two pieces 
having no points in common. (Such a separation could only come about by use of 
some point of the line L.) 

We shall not go into further detail about the construction of the mapping f :L— 
R, since we shall invoke the geometric interpretation of the set of real numbers only 
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for the sake of visualizability and perhaps to bring into play the reader’s very useful 
geometric intuition. As for the formal proofs, just as before, they will rely either on 
the collection of facts we have obtained from the axioms for the real numbers or 
directly on the axioms themselves. 

Geometric language, however, will be used constantly. 

We now introduce the following notation and terminology for the number sets 
listed below: 


Ja, bl := {x €R|a <x <b} 1s the open interval ab; 

[a, b] := {x €R|a <x <b} is the closed interval ab; 

Ja, b] := {x € R| a <x <D} is the half-open interval ab containing D; 
[a, bl := {x € R| a <x <)} is the half-open interval ab containing a. 


Definition 6 Open, closed, and half-open intervals are called numerical intervals 
or simply intervals. The numbers determining an interval are called its endpoints. 


The quantity b—a is called the length of the interval ab. If I is an interval, we 
shall denote its length by ||. (The origin of this notation will soon become clear.) 


The sets 


Ja, tool:={xeRla<x}, ]-o0, b[:={x eR] x <d} 


[a,toof:={xeR]a<x},  ]-co, bl :={x eR] x <)d} 


and ]—oo, +o0[ := R are conventionally called unbounded intervals or infinite in- 
tervals. 

In accordance with this use of the symbols +00 (read “plus infinity”) and —oo 
(read “minus infinity”) it is customary to denote the fact that the numerical set X is 
not bounded above (resp. below), by writing sup X = +00 (inf X = —oo). 


Definition 7 An open interval containing the point x € R will be called a neighbor- 
hood of this point. 


In particular, when 6 > 0, the open interval Jx — 6,x + d[ is called the 6- 
neighborhood of x. Its length is 25. 

The distance between points x, y € R is measured by the length of the interval 
having them as endpoints. 

So as not to have to investigate which of the points is “left” and which is “right”, 
that is, whether x < y or y < x and whether the length is y — x or x — y, we can use 
the useful function 


x when x > 0, 
|x| = 4 0 when x = 0, 
—x whenx <0, 


which is called the modulus or absolute value of the number. 
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Definition 8 The distance between x, y € R is the quantity |x — y|. 


The distance is nonnegative and equals zero only when the points x and y are 
the same. The distance from x to y is the same as the distance from y to x, since 
|x — y| = |y — x|. Finally, if z € R, then |x — y| < |x — z| + |z — y|. That is, the 
so-called triangle inequality holds. 

The triangle inequality follows from a property of the absolute value that is also 
called the triangle inequality (since it can be obtained from the preceding triangle 
inequality by setting z = 0 and replacing y by —y). To be specific, the inequality 


Ix + yl Slax] +lyl 


holds for any numbers x and y, and equality holds only when the numbers x and y 
are both negative or both positive. 


Proof 0 <x andO<y,thenO<x+y, |x+y|=x-+y, |x| =x, and |y| = y, so 
that equality holds in this case. 

If x <0 and y <0, thenx+y <0, |x+yl|=-(+ y)=—-—x — y, |x| = —-x, 
|y| = —y, and again we have equality. 

Now suppose one of the numbers is negative and the other positive, for example, 
x <O<y. Theneitherx <x+y <Oor0<x+y < y. Inthe first case |x + y| < |x|, 
and in the second case |x + y| < |y|, so that in both cases |x + y| < |x| +|y]. 


Using the principle of induction, one can verify that 
er tee teaal S lei tet bel, 


and equality holds if and only if the numbers x1, ...,x, are all nonnegative or all 
nonpositive. 

The number ate is often called the midpoint or center of the interval with end- 
points a and b, since it is equidistant from the endpoints of the interval. 

In particular, a point x € R is the center of its 6-neighborhood |x — 6, x + 6[ and 
all points of the 6-neighborhood lie at a distance from x less than 6. 


b. Defining a Number by Successive Approximations 


In measuring a real physical quantity, we obtain a number that, as a rule, changes 
when the measurement is repeated, especially if one changes either the method of 
making the measurement or the instrument used. Thus the result of measurement is 
usually an approximate value of the quantity being sought. The quality or precision 
of a measurement is characterized, for example, by the magnitude of the possible 
discrepancy between the true value of the quantity and the value obtained for it 
by measurement. When this is done, it may happen that we can never exhibit the 
exact value of the quantity (if it exists theoretically). Taking a more constructive 
position, however, we may (or should) consider that we know the desired quantity 
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completely if we can measure it with any preassigned precision. Taking this position 
is tantamount to identifying the number with a sequence® of more and more precise 
approximations by numbers obtained from measurement. But every measurement 
is a finite set of comparisons with some standard or with a part of the standard 
commensurable with it, so that the result of the measurement will necessarily be 
expressed in terms of natural numbers, integers, or, more generally, rational num- 
bers. Hence theoretically the whole set of real numbers can be described in terms of 
sequences of rational numbers by constructing, after due analysis, a mathematical 
copy or, better expressed, a model of what people do with numbers who have no 
notion of their axiomatic description. The latter add and multiply the approximate 
values rather than the values being measured, which are unknown to them. (To be 
sure, they do not always know how to say what relation the result of these operations 
has to the result that would be obtained if the computations were carried out with 
the exact values. We shall discuss this question below.) 

Having identified a number with a sequence of approximations to it, we should 
then, for example, add the sequences of approximate values when we wish to add 
two numbers. The new sequence thus obtained must be regarded as a new number, 
called the sum of the first two. But is it a number? The subtlety of the question 
resides in the fact that not every randomly constructed sequence is the sequence 
of arbitrarily precise approximations to some quantity. That is, one still has to learn 
how to determine from the sequence itself whether it represents some number or not. 
Another question that arises in the attempt to make a mathematical copy of opera- 
tions with approximate numbers is that different sequences may be approximating 
sequences for the same quantity. The relation between sequences of approximations 
defining a number and the numbers themselves is approximately the same as that 
between a point on a map and an arrow on the map indicating the point. The arrow 
determines the point, but the point determines only the tip of the arrow, and does not 
exclude the use of a different arrow that may happen to be more convenient. 

A precise description of these problems was given by Cauchy,’ who carried out 
the entire program of constructing a model of the real numbers, which we have only 
sketched. One may hope that after you study the theory of limits you will be able to 
repeat these constructions independently of Cauchy. 

What has been said up to now, of course, makes no claim to mathematical rigor. 
The purpose of this informal digression has been to direct the reader’s attention to 
the theoretical possibility that more than one natural model of the real numbers may 
exist. I have also tried to give a picture of the relation of numbers to the world around 
us and to clarify the fundamental role of natural and rational numbers. Finally, I 
wished to show that approximate computations are both natural and necessary. 


®If n is the number of the measurement and x, the result of that measurement, the correspondence 
Nt Xn is simply a function f : N— R of a natural-number argument, that is, by definition a se- 
quence (in this case a sequence of numbers). Section 3.1 is devoted to a detailed study of numerical 
sequences. 


7A. Cauchy (1789-1857) — French mathematician, one of the most active creators of the language 
of mathematics and the machinery of classical analysis. 
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The next part of the present section is devoted to simple but important estimates 
of the errors that arise in arithmetic operations on approximate quantities. These 
estimates will be used below and are of independent interest. 

We now give precise statements. 


Definition 9 If x is the exact value of a quantity and x a known approximation to 
the quantity, the numbers 
A(X) := |x —x| 
and 
A(x) 
|x| 
are called respectively the absolute and relative error of approximation by x. The 
relative error is not defined when x = 0. 


8(%) = 


Since the value x is unknown, the values of A(x) and 6(x) are also unknown. 
However, one usually knows some upper bounds A(x) < A and 8(x) < 6 for these 
quantities. In this case we say that the absolute or relative error does not exceed A or 
6 respectively. In practice we need to deal only with estimates for the errors, so that 
the quantities A and 6 themselves are often called the absolute and relative errors. 
But we shall not do this. 

The notation x = x + A means that x -A<x<x+A. 

For example, 


gravitational constant G = (6.672598 + 0.00085) x 10~!! N- m?/kg?, 
speed of light in vacuo c = 299792 458 m/s (exactly), 

Planck’s constant h = (6.6260755 + 0.0000040) x 10-34 J-s, 
charge of anelectron e = (1.60217733 + 0.00000049) x 10-9 ¢, 
rest mass of an electron me = (9.1093897 + 0.0000054) x 1077! kg. 


The main indicator of the precision of a measurement is the relative error in 
approximation, usually expressed as a percent. 
Thus in the examples just given the relative errors are at most (in order): 


13x 107°; 0; 6x1077; 311078; 6x 1077 
or, as percents of the measured values, 
13x 107° %: 0%: 6x107°%; 31x 10°°%: 6x 107> %. 


We now estimate the errors that arise in arithmetic operations with approximate 
quantities. 


Proposition /f 


|x — x| = A(X), ly— yl = AG), 
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then 


AE+5):=|A@+y)-E+H|< A+ AG), (2.1) 
Ay) :=|x-y—X- FS [XA + lVIAG@) + 4@)-A); 2.2) 


if, in addition, 


y#0, 540 and =O a, 
y 
then 
a(=)=|* Ae BED SIE. a (2.3) 
y y 2 7 1—4(y) 
Proof Letx =x +a and y= y+ B. Then 
A+ 9) =|@+y)—-4@+H)| =let Bl < lal +18] = AG) + AH), 
A(E- 9) = [xy —¥-F] =| +0)(F + B)-¥-5| = 
= |*B + ya + oB| < |X||B] + |¥|lo| + leap] = 
= |xX|A(y) + |¥|AG@) + AG) - ACY), 
a(=) = aia (da 
y y y yy 
_ | @+OI-O+)x / 1 [<u re a 
5 1+ B/y|~ ‘a 1—6(y) 
_ IAM +IyIAG@) 1 
i 1—8(3) 


These estimates for the absolute errors imply the following estimates for the 
relative errors: 


d(x+y)< eOnew) (2.1’) 
|x + y| 

d(X + ¥) <5(X) + 5(F) + 6(9) - (5), (2.2’) 
x (x) + 6(5) 

5 2.3! 

()s 1-36) — 


In practice, when working with sufficiently good approximations, we have A(x) - 
A(y) © 0, 6(x) - d(y) © O, and 1 — d(y) © 1, so that one can use the following 
simplified and useful, but formally incorrect, versions of formulas (2.2), (2.3), (2.2’), 
and (2.3): 


A(x - y) S$ |X|AY) + ly] AG), 
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a(=) Z AG) + zal 
y 3 


d(x y) <S(X) + d(y), 


x 7 s 
(=) < 8(%) + 8(5). 


Formulas (2.3) and (2.3’) show that it is necessary to avoid dividing by a number 
that is near zero and also to avoid using rather crude approximations in which y or 
1 — 8(¥) is small in absolute value. 

Formula (2.1’) warns against adding approximate quantities if they are close to 
each other in absolute value but opposite in sign, since then |x + y| is close to zero. 

In all these cases, the errors may increase sharply. 

For example, suppose your height has been measured twice by some device, 
and the precision of the measurement is +0.5 cm. Suppose a sheet of paper was 
placed under your feet before the second measurement. It may nevertheless happen 
that the results of the measurement are as follows: H; = (200 + 0.5) cm and H2 = 
(199.8 + 0.5) cm respectively. 

It does not make sense to try to find the thickness of the paper in the form of the 
difference Hz — H,, from which it would follow only that the thickness of the paper 
is not larger than 0.8 cm. That would of course be a crude reflection (if indeed one 
could even call it a “reflection’’) of the true situation. 

However, it is worthwhile to consider another more hopeful computational effect 
through which comparatively precise measurements can be carried out with crude 
devices. For example, if the device just used for measuring your height was used to 
measure the thickness of 1000 sheets of the same paper, and the result was (20 + 
0.5) cm, then the thickness of one sheet of paper is (0.02 + 0.0005) cm, which is 
(0.2 + 0.005) mm, as follows from formula (2.1). 

That is, with an absolute error not larger than 0.005 mm, the thickness of one 
sheet is 0.2 mm. The relative error in this measurement is at most 0.025 or 2.5 %. 

This idea can be developed and has been proposed, for example, as a way of 
detecting a weak periodic signal amid the larger random static usually called white 
noise. 


c. The Positional Computation System 


It was stated above that every real number can be presented as a sequence of ratio- 
nal approximations. We now recall a method, which is important when it comes to 
computation, for constructing in a uniform way a sequence of such rational approx- 
imations for every real number. This method leads to the positional computation 
system. 


Lemma /f a number q > | is fixed, then for every positive number x € R there 
exists a unique integer k € Z such that 


qe <x< gk. 
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Proof We first verify that the set of numbers of the form q*, k € N, is not bounded 
above. If it were, it would have a least upper bound s, and by definition of the least 
upper bound, there would be a natural number m € N such that 7 <q” <s. But 


then s < q’*!, so that s could not be an upper bound of the set. 

Since | < q, it follows that g” <q” when m <n for all m,n € Z. Hence we have 
also shown that for every real number c € R there exists a natural number N € N 
such that c <q” foralln > N. 

It follows that for any ¢ > 0 there exists M € N such that a < é for all natural 
numbers m > M. 

Indeed, it suffices to set c = i and N = M; then : <q™ whenm> M. 

Thus the set of integers m € Z satisfying the inequality x < q” for x > 0 is 
bounded below. It therefore has a minimal element k, which obviously will be the 
one we are seeking, since, for this integer, g'—! < x < q*. 

The uniqueness of such an integer k follows from the fact that if m,n € Z and, 
for example, m <n, then m <n — 1. Hence if q > 1, then g” < gt, 

Indeed, it can be seen from this remark that the inequalities g’”~! < x <q” and 
gq"! <x <q", which imply g”~! <x <q", are incompatible if m £n. 


We shall use this lemma in the following construction. Fix q > 1 and take an 
arbitrary positive number x € R. By the lemma we find a unique number p € Z 
such that 


G exea™. (2.4) 


Definition 10 The number p satisfying (2.4) is called the order of x in the base q 
or (when gq is fixed) simply the order of x. 


By the principle of Archimedes, we find a unique natural number ay € N such 
that 
apg? <x <apq?+q?. (2.5) 


Taking (2.4) into account, one can assert that a, € {1,...,q — 1}. 

All of the subsequent steps in our construction will repeat the step we are about 
to take, starting from relation (2.5). 

It follows from relation (2.5) and the principle of Archimedes that there exists a 
unique number @,_—; € {0, 1,...,q@ — 1} such that 


apg? + apg?! <x <a@pqPht+ apg?! aught, (2.6) 
If we have made n such steps, obtaining the relation 


apq? + apg?! Teel Ap—ng? ” Ss 


<x < apg? +p—1g? | t+ tap ng?” +q?™, 
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then by the principle of Archimedes there exists a unique number @p_»—1 € 
{0, 1,...,q — 1} such that 


—n —n—l| 
apg? +---+ap—ng? + op—n—1q? < 
- —n-1 =a 
Sx < apg? +---+ap—ng? "+ apy —n—1q? a +q? ars 


Thus we have exhibited an algorithm by means of which a sequence of numbers 
Op, Qp—1,--+,M@p—n,-.. from the set {0,1,...,¢q — 1} is placed in correspondence 
with the positive number x. Less formally, we have constructed a sequence of ratio- 
nal numbers of the special form 


ln = apg? +-+-+opng? ", (2.7) 
and such that 


Ty <X <n t rat (2.8) 

In other words, we construct better and better approximations from below 
and from above to the number x using the special sequence (2.7). The symbol 
Qp...Ap—n... 18 a code for the entire sequence {r,}. To recover the sequence {7} 
from this symbol it is necessary to indicate the value of p, the order of x. 

For p > 0 it is customary to place a period or comma after ag; for p < 0, the 
convention is to place | p| zeros left of a» and a period or comma right of the leftmost 
zero (we recall that a, 4 0). 

For example, when qg = 10, 


123.45 :=1x 10*+2~x 10!+3x10°+4~x 107! +5 x 107”, 
0.00123 := 1 x 1077 +2 x 10°47 +3 x 107: 


and when g = 2, 
1000.001 := 1-23 + 1-273, 


Thus the value of a digit in the symbol a)...a@p)—y... depends on the position it 
occupies relative to the period or comma. 

With this convention, the symbol a, ...a@o... makes it possible to recover the 
whole sequence of approximations. 

It can be seen by inequalities (2.8) (verify this!) that different sequences {r,,} and 
{r/,}, and therefore different symbols a p---dg... and Oy a a ..., correspond to 
different numbers x and x’. 

We now answer the question whether some real number x € R corresponds to 


every symbol a, ...aq.... The answer turns out to be negative. 
We remark that by virtue of the algorithm just described for obtaining the num- 
bers a&p—n € {0, 1,...,q — 1} successively, it cannot happen that all these numbers 


from some point on are equal to g — 1. 
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Indeed, if 
Pn = pq? +--+ peg? +g — Dgh 1 + +g — Dgh™ 


for all n > k, that is, 


1 1 


ra =k} Timp nap’ (2.9) 


then by (2.8) we have 
rk + 
q 
Then for any n > k 


1 
O<rpet+ qe x< qa’ 


which, as we know from 8° above, is impossible. 
It is also useful to note that if at least one of the numbers a@p—,-1,... , @p—n is 
less than g — 1, then instead of (2.9) we can write 


1 
Tn <Tk timp — GaP 


or, what is the same 


1 1 
et gap Et Gag (2.10) 
We can now prove that any symbol a, ...a@9..., composed of the numbers ax € 
{0, 1, ...,q—1}, and in which there are numbers different from g — 1 with arbitrarily 


large indices, corresponds to some number x > 0. 

Indeed, from the symbol ap ...@p—n... let us construct the sequence {r,} of the 
form (2.7). By virtue of the relations r9 <7, <7, <---, taking account of (2.9) and 
(2.10), we have 


1 1 
eT arr (2.11) 


ToS] See Se Se St 


= gp 

The strict inequalities in this last relation should be understood as follows: ev- 
ery element of the left-hand sequence is less than every element of the right-hand 
sequence. This follows from (2.10). 

If we now take x = sup, ey ln(= infpen(Ta + q "-P))), then the sequence {ry} 
will satisfy conditions (2.7) and (2.8), that is, the symbol ap .. .a@p—n ... corresponds 
to the number x € R. 

Thus, we have established a one-to-one correspondence between the positive 
numbers x € R and symbols of the form ap...ao,... if p> 0 or 0,0...0a,... 

——— 


| p| zeros 
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if p < 0. The symbol assigned to x is called the g-ary representation of x; the num- 
bers that occur in the symbol are called its digits, and the position of a digit relative 
to the period is called its rank. 

We agree to assign to a number x < 0 the symbol for the positive number —x, 
prefixed by a negative sign. Finally, we assign the symbol 0.0...0... to the num- 
ber 0. 

In this way we have constructed the positional q-ary system of writing real num- 
bers. 

The most useful systems are the decimal system (in common use) and for tech- 
nical reasons the binary system (in electronic computers). Less common, but also 
used in some parts of computer engineering are the ternary and octal systems. 

Formulas (2.7) and (2.8) show that if only a finite number of digits are retained 
in the g-ary expression of x (or, if we wish, we may say that the others are replaced 
with zeros), then the absolute error of the resulting approximation (2.7) for x does 
not exceed one unit in the last rank retained. 

This observation makes it possible to use the formulas obtained in Paragraph b 
to estimate the errors that arise when doing arithmetic operations on numbers as a 
result of replacing the exact numbers by the corresponding approximate values of 
the form (2.7). 

This last remark also has a certain theoretical value. To be specific, if we identify 
a real number x with its g-ary expression, as was suggested in Paragraph b, once 
we have learned to perform arithmetic operations directly on the g-ary symbols, we 
will have constructed a new model of the real numbers, seemingly of greater value 
from the computational point of view. 

The main problems that need to be solved in this direction are the following: 

To two q-ary symbols it is necessary to assign a new symbol representing their 
sum. It will of course be constructed one step at a time. To be specific, by adding 
more and more precise rational approximations of the original numbers, we shall 
obtain rational approximations corresponding to their sum. Using the remark made 
above, one can show that as the precision of the approximations of the terms in- 
creases, we shall obtain more and more q-ary digits of the sum, which will then not 
vary under subsequent improvements in the approximation. 

This same problem needs to be solved with respect to multiplication. 

Another, less constructive, route for passing from rational numbers to all real 
numbers is due to Dedekind. 

Dedekind identifies a real number with a cut in the set Q of rational numbers, 
that is, a partition of Q into two disjoint sets A and B such thata <b forallae A 
and all b € B. Under this approach to real numbers our axiom of completeness 
(continuity) becomes a well-known theorem of Dedekind. For that reason the axiom 
of completeness in the form we have given it is sometimes called Dedekind’s axiom. 

To summarize, in the present section we have exhibited the most important 
classes of numbers. We have shown the fundamental role played by the natural 
and rational numbers. It has been shown how the basic properties of these numbers 
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follow from the axiom system® we have adopted. We have given a picture of various 
models of the set of real numbers. We have discussed the computational aspects of 
the theory of real numbers: estimates of the errors arising during arithmetical oper- 
ations with approximate magnitudes, and the g-ary positional computation system. 


2.2.5 Problems and Exercises 


1. Using the principle of induction, show that 


a) the sum x; +---+x, of real numbers is defined independently of the insertion 
of parentheses to specify the order of addition; 

b) the same is true of the product x1 --- xy; 

C) [x1 +--+ +4n| S ler] +--+ [als 

d) |x1-+-Xn| = |x1|--- Pals 

e) (m,nEN)A(m <n)) > ((n—m) EN); 

f) d+)" > 1+ nx for x > —1 andn EN, equality holding only when n = 1 
or x = 0 (Bernoulli’s inequality); 

g) a+b)" =a" t Fab + an 2p? 4... 4+ EB"! +b" (New- 
ton’s binomial formula). 


2. a) Verify that Z and Q are inductive sets. 
b) Give examples of inductive sets different from N, Z, Q, and R. 


3. Show that an inductive set is not bounded above. 
4. a) Show that an inductive set is infinite (that is, equipotent with one of its subsets 
different from itself). 

b) The set E, = {x € N | x <n} is finite. (We denote card E,, by n.) 


5. (The Euclidean algorithm) a) Let m,n € N and m > n. Their greatest common 
divisor (gcd(m,n) = d € N) can be found in a finite number of steps using the 
following algorithm of Euclid involving successive divisions with remainder. 


m=qn+r (<n), 
n=qor,t+r2 (2 <r), 
r=q3r2+r3 (73 <1r2), 


rk-1 = Oki" + 02~ 


Then d = rx. 


8Tt was stated by Hilbert in almost the form given above at the turn of the twentieth century. See for 
example Hilbert, D. Foundations of Geometry, Chap. Il, §13. (Translated from the second edition 
of Grundlagen der Geometrie, La Salle, Hlinois: Open Court Press, 1971. This section was based 
on Hilbert’s article “Uber den Zahlbegriff” in Jahresbericht der deutschen Mathematikervereini- 
gung 8 (1900).) 
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b) If d = gcd(m, n), one can choose numbers p,q € Z such that pm + qn =d; 
in particular, if m and n are relatively prime, then pm + gn = 1. 


6. Try to give your own proof of the fundamental theorem of arithmetic (Para- 
graph a in Sect. 2.2.2). 

7. Show that if the product m - n of natural numbers is divisible by a prime p, that 
is, m-n=p-k, where k €N, then either m or n is divisible by p. 

8. It follows from the fundamental theorem of arithmetic that the set of prime num- 
bers is infinite. 

9, Show that if the natural number n is not of the form k”, where k,m € N, then 
the equation x” =n has no rational roots. 

10. Show that the expression of a rational number in any g-ary computation system 
is periodic, that is, starting from some rank it consists of periodically repeating 
groups of digits. 

11. Let us call an irrational number w € R well approximated by rational numbers 
if for any natural numbers n, N € N there exists a rational number : such that 


1 


a) Construct an example of a well-approximated irrational number. 
b) Prove that a well-approximated irrational number cannot be algebraic, that is, 
it is transcendental (Liouville’s theorem).? 


12. Knowing that 7 :=m- n—! by definition, where m € Z and n € N, derive the 
“rules” for addition, multiplication, and division of fractions, and also the condition 
for two fractions to be equal. 
13. Verify that the rational numbers Q satisfy all the axioms for real numbers except 
the axiom of completeness. 
14. Adopting the geometric model of the set of real numbers (the real line), show 
how to construct the numbers a + b, a — b, ab, and ; in this model. 
15. a) Illustrate the axiom of completeness on the real line. 

b) Prove that the least-upper-bound principle is equivalent to the axiom of com- 
pleteness. 


16. a) If AC BCR, then supA < sup B and inf A > inf B. 

b) LettRD XASGPandRDYF7.If x <y forall x € X andall y € Y, then 
X is bounded above, Y is bounded below, and sup X < inf Y. 

c) If the sets X, Y in b) are such that X UY = R, then sup X = inf Y. 

d) If X and Y are the sets defined in c), then either X has a maximal element 
or Y has a minimal element. (Dedekind’ s theorem.) 

e) (Continuation.) Show that Dedekind’s theorem is equivalent to the axiom of 
completeness. 


17. Let A+ B be the set of numbers of the form a+ b and A- B the set of numbers 
of the form a-b, where a € A C R and b € B CR. Determine whether it is always 
true that 


°J. Liouville (1809-1882) — French mathematician, who wrote on complex analysis, geometry, 
differential equations, number theory, and mechanics. 
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a) sup(A + B) = sup A + sup B, 
b) sup(A- B) = supA- sup B. 


18. Let —A be the set of numbers of the form —a, where a € A C R. Show that 
sup(—A) = —inf A. 
19. a) Show that for n € N and a > 0 the equation x” = a has a positive root 
(denoted 2/a or a!/"), 

b) Verify that fora > 0,b >0,andn,meéeN 


Vab = %/a-b and va= meg, 


c) (any” = (a”)in =: qn/n and ai/”" .qi/m =q!/ntl/m_ 
d) (am/")—1 = (a—tym/n =: qum/n. 
e) Show that for all r},r2 € Q 


Y 
a .q@ =a"? and (Qt) Sa, 


20. a) Show that the inclusion relation is a partial ordering relation on sets (but not 
a linear ordering!). 

b) Let A, B, and C be sets such that A CC, BCC, A\B#¢@,and B\AF GS. 
We introduce a partial ordering into this triple of sets as in a). Exhibit the maximal 
and minimal elements of the set {A, B, C}. (Pay attention to the non-uniqueness!) 


21. a) Show that, just like the set Q of rational numbers, the set Q(./7) of numbers 
of the form a + b./n, where a, b € Q and n is a fixed natural number that is not the 
square of any integer, is an ordered set satisfying the principle of Archimedes but 
not the axiom of completeness. 

b) Determine which axioms for the real numbers do not hold for Q(,/7) if the 
standard arithmetic operations are retained in Q(,/n) but order is defined by the 
rule (a+bJ/n <a’+b'J/n):= ((b <b) Vv (b= D/A (a <.’))). Will QC /n) now 
satisfy the principle of Archimedes? 

c) Order the set P[x] of polynomials with rational or real coefficients by spec- 
ifying that 


P(x) = an + ayx +--+ +amx™" >0, if am > 0. 
d) Show that the set Q(x) of rational fractions 


dg + ayx +++ + anx™ 
~ Dp Bix eb bax” 


m,n 


with coefficients in Q or R becomes an ordered field, but not an Archimedean or- 
dered field, when the order relation Ry», > 0 is defined to mean a,b, > 0 and 
the usual arithmetic operations are introduced. This means that the principle of 
Archimedes cannot be deduced from the other axioms for R without using the axiom 
of completeness. 
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22. Letn € N and n > 1. In the set E, = {0,1,..., — 1} we define the sum and 
product of two elements as the remainders when the usual sum and product in R are 
divided by n. With these operations defined on it, the set E,, is denoted Z,. 


a) Show that if n is not a prime number, then there are nonzero numbers m, k 
in Z, such that m -k = 0. (Such numbers are called zero divisors.) This means that 
in Z, the equation a- b =c- b does not imply that a = c, even when b 4 0. 

b) Show that if p is prime, then there are no zero divisors in Z, and Zp is a 
field. 

c) Show that, no matter what the prime p, Z, cannot be ordered in a way 
consistent with the arithmetic operations on it. 


23. Show that if R and R’ are two models of the set of real numbers and f : R > R’ 


is a mapping such that f(x + y) = f(x) + fQ) and f(x- y) = f(x): fQ) for any 
x,y ER, then 


a) f(0)=0'; 

b) fC) =U if f(x) 40’, which we shall henceforth assume; 

c) f(m) =m’ where m € Z and m’' € Z’, and the mapping f : Z > Z’ is injec- 
tive and preserves the order. 

d) f(4) = 7, where m,neZ,n0, m',n’ €Z',n' 40, f(m) =n’, 


f(a) =n’. Thus f :Q— Q is a bijection that preserves order. 
e) f :R—R’ isa bijective mapping that preserves order. 


24. On the basis of the preceding exercise and the axiom of completeness, show 
that the axiom system for the set of real numbers determines it completely up to an 
isomorphism (method of realizing it), that is, if R and R’ are two sets satisfying these 
axioms, then there exists a one-to-one correspondence f : R — R’ that preserves 
the arithmetic operations and the order: f(x + y) = f(x) + fQ), f®-y) = f()- 
f(y), and (x < y) & (f(@) < fy). 


25. A number x is represented on a computer as 


hog 

n 

e=4q?> = 
n=l 


qd n 


where p is the order of x and M = ee a is the mantissa of the number x G < 
M <1). 
Now a computer works only with a certain range of numbers: for g = 2 usually 
|p| < 64, and k = 35. Evaluate this range in the decimal system. 
26. a) Write out the (6 x 6) multiplication table for multiplication in base 6. 

b) Using the result of a), multiply “columnwise” in the base-6 system 


(532)6 
x 


(145)6¢6 


and check your work by repeating the computation in the decimal system. 
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c) Perform the “long” division 


(1301)6|(25)6 


and check your work by repeating the computation in the decimal system. 
d) Perform the “columnwise” addition 


(4052)¢ 
x 


(3125)¢6 


27. Write (100) jo in the binary and ternary systems. 
28. a) Show that along with the unique representation of an integer as 


(Qn Qn] ..-0)3, 


where a; € {0, 1, 2}, it can also be written as 


(Bn Bn-1 tee Bo)3; 


where 6 € {—1, 0, 1}. 

b) What is the largest number of coins from which one can detect a counter- 
feit in three weighings with a pan balance, if it is known in advance only that the 
counterfeit coin differs in weight from the other coins? 


29. What is the smallest number of questions to be answered “yes” or “no” that one 
must pose in order to be sure of determining a 7-digit telephone number? 
30. a) How many different numbers can one define using 20 decimal digits (for 
example, two ranks with 10 possible digits in each)? Answer the same question for 
the binary system. Which system does a comparison of the results favor in terms of 
efficiency? 

b) Evaluate the number of different numbers one can write, having at one’s 
disposal n digits of a g-ary system. (Answer: q”/7.) 

c) Draw the graph of the function f(x) = x”/* over the set of natural-number 
values of the argument and compare the efficiency of the different systems of com- 
putation. 


2.3 Basic Lemmas Connected with the Completeness of the Real 
Numbers 


In this section we shall establish some simple useful principles, each of which could 
have been used as the axiom of completeness in our construction of the real num- 
bers.!° 


10See Problem 4 at the end of this section. 
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We have called these principles basic lemmas in view of their extensive applica- 
tion in the proofs of a wide variety of theorems in analysis. 


2.3.1 The Nested Interval Lemma (Cauchy—Cantor Principle) 


Definition 1 A function f :N — X of a natural-number argument is called a se- 
quence or, more fully, a sequence of elements of X. 


The value f(n) of the function f corresponding to the number n € N is often 
denoted x, and called the mth term of the sequence. 


Definition 2 Let X,, X2,..., Xn,... be a sequence of sets. If X; D X2D---D 
Xn D---, thatis X, D Xn+1 for all n € N, we say the sequence is nested. 


Lemma (Cauchy—Cantor) For any nested sequence 1 DInD---DI,nD--- of 
closed intervals, there exists a point c € R belonging to all of these intervals. 

If in addition it is known that for any ¢ > 0 there is an interval Ix whose length 
|, | is less than e, then c is the unique point common to all the intervals. 


Proof We begin by remarking that for any two closed intervals J, = [am, bm] and 
Tn = [an, by] of the sequence we have a, < b,. For otherwise we would have a, < 
bn < Am < bm, that is, the intervals J,, and J, would be mutually disjoint, while one 
of them (the one with the larger index) is contained in the other. 

Thus the numerical sets A = {a | m € N} and B = {b, | n € N} satisfy the hy- 
potheses of the axiom of completeness, by virtue of which there is a number c € R 
such that am <c < by for all a» € A and all b, € B. In particular, a, <c < b, for 
all n € N. But that means that the point c belongs to all the intervals /,,. 

Now let c; and cz be two points having this property. If they are different, say 
C| <2, then for any n € N we have a, < cy < cz < by, and therefore 0 < cz — cy < 
bn — Gn, So that the length of an interval in the sequence cannot be less than c2 — c1. 
Hence if there are intervals of arbitrarily small length in the sequence, their common 
point is unique. 


2.3.2 The Finite Covering Lemma (Borel-Lebesgue Principle, 
or Heine—Borel Theorem) 


Definition 3 A system S = {X} of sets X is said to cover a set Y if Y CUyes X, 
(that is, if every element y € Y belongs to at least one of the sets X in the system S). 


A subset of a set S = {X} that is a system of sets will be called a subsystem of S. 
Thus a subsystem of a system of sets is itself a system of sets of the same type. 
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Lemma (Borel—Lebesgue!!) Every system of open intervals covering a closed in- 
terval contains a finite subsystem that covers the closed interval. 


Proof Let S = {U} be a system of open intervals U that cover the closed interval 
[a, b] = J). If the interval 7; could not be covered by a finite set of intervals of the 
system S, then, dividing J; into two halves, we would find that at least one of the 
two halves, which we denote by /2, does not admit a finite covering. We now repeat 
this procedure with the interval /2, and so on. 

In this way a nested sequence Jj D 12 D---D I, D--- of closed intervals arises, 
none of which admit a covering by a finite subsystem of S. Since the length of the 
interval J, is |J,| = |1|-27”, the sequence {J,} contains intervals of arbitrarily 
small length (see the lemma in Paragraph c of Sect. 2.2.4). But the nested interval 
theorem implies that there exists a point c belonging to all of the intervals J,,n € N. 
Since c € [; = [a, b] there exists an open interval Ja, B[ = U € S containing c, that 
is, a <c < B. Let e = min{c — a, B — c}. In the sequence just constructed, we 
find an interval /,, such that |J,| < ¢. Since c € J, and |J,| < €, we conclude that 
I, C U = a, B[. But this contradicts the fact that the interval J, cannot be covered 
by a finite set of intervals from the system. 


2.3.3 The Limit Point Lemma (Bolzano—Weierstrass Principle) 


We recall that we have defined a neighborhood of a point x € R to be an open 
interval containing the point and the 6-neighborhood about x to be the open interval 
jx —6,x+6[. 


Definition 4 A point p € R is a limit point of the set X C R if every neighborhood 
of the point contains an infinite subset of X. 


This condition is obviously equivalent to the assertion that every neighborhood 
of p contains at least one point of X different from p itself. (Verify this!) 

We now give some examples. 

If X= {i € R|n €N}, the only limit point of X is the point 0 € R. 

For an open interval Ja, b[ every point of the closed interval [a, b] is a limit point, 
and there are no others. 

For the set Q of rational numbers every point of R is a limit point; for, as we 
know, every open interval of the real numbers contains rational numbers. 


1. Borel (1871-1956) and H. Lebesgue (1875-1941) — well-known French mathematicians who 
worked in the theory of functions. 
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Lemma (Bolzano—Weierstrass!*) Every bounded infinite set of real numbers has at 
least one limit point. 


Proof Let X be the given subset of R. It follows from the definition of boundedness 
that X is contained in some closed interval J C R. We shall show that at least one 
point of J is a limit point of X. 

If such were not the case, then each point x € J would have a neighborhood 
U(x) containing either no points of X or at most a finite number. The totality of 
such neighborhoods {U(x)} constructed for the points x € J forms a covering of 
I by open intervals U(x). By the finite covering lemma we can extract a system 
U(x), ..., U(x») of open intervals that cover J. But, since X C J, this same system 
also covers X. However, there are only finitely many points of X in U(x;), and 
hence only finitely many in their union. That is, X is a finite set. This contradiction 
completes the proof. 


2.3.4 Problems and Exercises 


1. Show that 


a) if J is any system of nested closed intervals, then 
sup{a ER|[a, ble} =a <f=inf{beR| [a,b] €7} 


and 


[«, B1= (| (a,b): 


[a,blel 


b) if J is a system of nested open intervals Ja, b[ the intersection ‘al a,ble 71a, b[ 
may happen to be empty. 
Hint :Jan, bnf=]0, +1. 


on 
2. Show that 


a) from a system of closed intervals covering a closed interval it is not always 
possible to choose a finite subsystem covering the interval; 

b) from a system of open intervals covering an open interval it is not always 
possible to choose a finite subsystem covering the interval; 

c) from a system of closed intervals covering an open interval it is not always 
possible to choose a finite subsystem covering the interval. 


!2B. Bolzano (1781-1848) — Czech mathematician and philosopher. K. Weierstrass (1815-1897) 
— German mathematician who devoted a great deal of attention to the logical foundations of math- 
ematical analysis. 
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3. Show that if we take only the set Q of rational numbers instead of the complete 
set R of real numbers, taking a closed interval, open interval, and neighborhood of 
a point r € Q to mean respectively the corresponding subsets of Q, then none of the 
three lemmas proved above remains true. 

4. Show that we obtain an axiom system equivalent to the one already given if we 
take as the axiom of completeness 


a) the Bolzano—Weierstrass principle 
or 

b) the Borel—Lebesgue principle (Heine—Borel theorem). 

Hint: The principle of Archimedes and the axiom of completeness in the earlier 
form both follow from a). 

c) Replacing the axiom of completeness by the Cauchy—Cantor principle leads 
to a system of axioms that becomes equivalent to the original system if we also 
postulate the principle of Archimedes. (See Problem 21 in Sect. 2.2.2.) 


2.4 Countable and Uncountable Sets 


We now make a small addition to the information about sets that was provided in 
Chap. |. This addition will be useful below. 


2.4.1 Countable Sets 


Definition 1 A set X is countable if it is equipollent with the set N of natural num- 
bers, that is, card X = card N. 


Proposition 


a) An infinite subset of a countable set is countable. 
b) The union of the sets of a finite or countable system of countable sets is a count- 
able set. 


Proof a) It suffices to verify that every infinite subset E of N is equipollent with N. 
We construct the needed bijective mapping f : N > E as follows. There is a 
minimal element of E; := E, which we assign to the number | € N and denote 
e, € E. The set E is infinite, and therefore FE, := E,\e; is nonempty. We assign 
the minimal element of E2 to the number 2 and call it e2 € Ex. We then consider 
E3 := E\{e, eo}, and so forth. Since E is an infinite set, this construction cannot 
terminate at any finite step with index n € N. As follows from the principle of in- 
duction, we assign in this way a certain number e, € E to eachn € N. The mapping 
f :N-— E is obviously injective. 
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It remains to verify that it is surjective, that is, f(N) = E. Let e € E. The set 
{n € N |n < e} is finite, and hence the subset of it {n € E | n < e} is also finite. Let 
k be the number of elements in the latter set. Then by construction e = ex. 

b) If X1,..., Xn,... 18 a countable system of sets and each set X= com ee 
xi, ...} 1s itself countable, then since the cardinality of the set X = U, eN Xn, which 
consists of the elements x/, where m,n €N, is not less than the cardinality of each 
of the sets X;,, it follows that X is an infinite set. The element x/, € Xm can be 
identified with the pair (m, n) of natural numbers that defines it. Then the cardinality 
of X cannot be greater than the cardinality of the set of all such ordered pairs. But the 
mapping f : N x NN given by the formula (m,n) (mta— ania) +m, as 
one can easily verify, is bijective. (It has a visualizable meaning: we are enumerating 
the points of the plane with coordinates (m,n) by successively passing from points 
of one diagonal on which m + n is constant to the points of the next such diagonal, 
where the sum is one larger.) 

Thus the set of ordered pairs (m,n) of natural numbers is countable. But then 
card X < cardN, and since X is an infinite set we conclude on the basis of a) that 
card X = cardN. 


It follows from the proposition just proved that any subset of a countable set is 
either finite or countable. If it is known that a set is either finite or countable, we say 
it is at most countable. (An equivalent expression is card X < cardN.) 

We can now assert, in particular, that the union of an at most countable family of 
at most countable sets is at most countable. 


Corollaries 


1) cardZ = cardN. 
2) cardN? = cardN. 

(This result means that the direct product of countable sets is countable.) 
3) cardQ=cardN, that is, the set of rational numbers is countable. 


Proof A rational number ™ is defined by an ordered pair (m,n) of integers. Two 
pairs (m,n) and (m’,n’) define the same rational number if and only if they are 
proportional. Thus, choosing as the unique pair representing each rational num- 
ber the pair (m,n) with the smallest possible positive integer denominator n € N, 
we find that the set Q is equipollent to some infinite subset of the set Z x Z. But 
card Z? = cardN and hence card Q = cardN. 


4) The set of algebraic numbers is countable. 


Proof We remark first of all that the equality Q x Q = card N implies, by induction, 
that card Q* = card N for every k € N. 

An element r € Qé is an ordered set (r1,..., 7%) of k rational numbers. 

An algebraic equation of degree k with rational coefficients can be written in the 
reduced form x* + ryx*—1 4... + 7% = 0, where the leading coefficient is 1. Thus 
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there are as many different algebraic equations of degree k as there are different 
ordered sets (r1,..., 7%) of rational numbers, that is, a countable set. 

The algebraic equations with rational coefficients (of arbitrary degree) also form 
a countable set, being a countable union (over degrees) of countable sets. Each such 
equation has only a finite number of roots. Hence the set of algebraic numbers is at 
most countable. But it is infinite, and hence countable. 


2.4.2 The Cardinality of the Continuum 


13 


Definition 2 The set R of real numbers is also called the number continuum,’ and 


its cardinality the cardinality of the continuum. 
Theorem (Cantor) card N < cardR. 


This theorem asserts that the infinite set IR has cardinality greater than that of the 
infinite set N. 


Proof We shall show that even the closed interval [0, 1] is an uncountable set. 

Assume that it is countable, that is, can be written as a sequence x), X2,..., 
Xn,..-. Take the point x; and on the interval [0, 1] = Jo fix a closed interval of 
positive length /; not containing the point x;. In the interval J; construct an interval 
Ty not containing x2. If the interval /,, has been constructed, then, since |J,,| > 0, we 
construct in it an interval J,41 so that x41 ¢ In41 and |J,41| > 0. By the nested set 
lemma, there is a point c belonging to all of the intervals Jo, 1,..., In, .... But this 
point of the closed interval Jy = [0, 1] by construction cannot be any point of the 
sequence x1,%2,...,Xpn,---- 


Corollaries 


1) QS¥R, and so irrational numbers exist. 
2) There exist transcendental numbers, since the set of algebraic numbers is 
countable. 


(After solving Exercise 3 below, the reader will no doubt wish to reinterpret this 
last proposition, stating it as follows: Algebraic numbers are occasionally encoun- 
tered among the real numbers.) 

At the very dawn of set theory the question arose whether there exist sets of 
cardinality between countable sets and sets having cardinality of the continuum, 
and the conjecture was made, known as the continuum hypothesis, that there are no 
intermediate cardinalities. 


13 From the Latin continuum, meaning continuous, or solid. 
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The question turned out to involve the deepest parts of the foundations of mathe- 
matics. It was definitively answered in 1963 by the American mathematician P. Co- 
hen. Cohen proved that the continuum hypothesis is undecidable by showing that 
neither the hypothesis nor its negation contradicts the standard axiom system of set 
theory, so that the continuum hypothesis can be neither proved nor disproved within 
that axiom system. This situation is very similar to the way in which Euclid’s fifth 
postulate on parallel lines is independent of the other axioms of geometry. 


2.4.3 Problems and Exercises 


1. Show that the set of real numbers has the same cardinality as the points of the 
interval ]—1, 1[. 
2. Give an explicit one-to-one correspondence between 


a) the points of two open intervals; 

b) the points of two closed intervals; 

c) the points of a closed interval and the points of an open interval; 
d) the points of the closed interval [0, 1] and the set R. 


3. Show that 


a) every infinite set contains a countable subset; 

b) the set of even integers has the same cardinality as the set of all natural num- 
bers; 

c) the union of an infinite set and an at most countable set has the same cardi- 
nality as the original infinite set; 

d) the set of irrational numbers has the cardinality of the continuum; 

e) the set of transcendental numbers has the cardinality of the continuum. 


4. Show that 


a) the set of increasing sequences of natural numbers {n1 < nz < ---} has the 
same cardinality as the set of fractions of the form 0.a)a2...; 
b) the set of all subsets of a countable set has cardinality of the continuum. 


5. Show that 


a) the set P(X) of subsets of a set X has the same cardinality as the set of all 
functions on X with values 0, 1, that is, the set of mappings f : X — {0, 1}; 

b) fora finite set X of n elements, card P(X) = 2”; 

c) taking account of the results of Exercises 4b) and 5a), one can write 
card P(X) = 2°4X | and, in particular, card P(N) = 2°"4N = cardR; 

d) for any set X 


ard X 
gear ; 


card X < in particular, n<2" foranyneN. 


Hint: See Cantor’s theorem in Sect. 1.4.1. 
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6. Let X;,..., X, bea finite system of finite sets. Show that 
m 
=(0 x) = Y card Xi, _ > card(X;, 1 Xi.) + 
i=1 iy 1, <i2 


ai os card(X;, Xj, Xi3) — ++ + 


11 <i2<13 
+ (=1)"—! card(X1 + Xn), 


the summation extending over all sets of indices from 1 to m satisfying the inequal- 
ities under the summation signs. 

7. On the closed interval [0, 1] C R describe the sets of numbers x € [0, 1] whose 
ternary representation x = 0.a;a203..., a; € {0, 1, 2}, has the property: 


a) a #1; 
b) @ FDAMFD; 
c) Vi € N (a; ¥ 1) (the Cantor set). 


8. (Continuation of Exercise 7.) Show that 


a) the set of numbers x € [0, 1] whose ternary representation does not contain | 
has the same cardinality as the set of all numbers whose binary representation has 
the form 0.6) B2...; 

b) the Cantor set has the same cardinality as the closed interval [0, 1]. 


Chapter 3 
Limits 


In discussing the various aspects of the concept of a real number we remarked in 
particular that in measuring real physical quantities we obtain sequences of approx- 
imate values with which one must then work. 

Such a state of affairs immediately raises at least the following three questions: 


1) What relation does the sequence of approximations so obtained have to the quan- 
tity being measured? We have in mind the mathematical aspect of the question, 
that is, we wish to obtain an exact expression of what is meant in general by 
the expression “sequence of approximate values” and the extent to which such a 
sequence describes the value of the quantity. Is the description unambiguous, or 
can the same sequence correspond to different values of the measured quantity? 

2) How are operations on the approximate values connected with the same opera- 
tions on the exact values, and how can we characterize the operations that can 
legitimately be carried out by replacing the exact values with approximate ones? 

3) How can one determine from a sequence of numbers whether it can be a se- 
quence of arbitrarily precise approximations of the values of some quantity? 


The answer to these and related questions is provided by the concept of the limit 
of a function, one of the fundamental concepts of analysis. 

We begin our discussion of the theory of limits by considering the limit of a 
function of a natural-number argument (a sequence), in view of the fundamental 
role played by these functions, as already explained, and also because all the basic 
facts of the theory of limits can actually be clearly seen in this simplest situation. 


3.1 The Limit of a Sequence 
3.1.1 Definitions and Examples 


We recall the following definition. 


Definition 1 A function f : N— X whose domain of definition is the set of natural 
numbers is called a sequence. 


© Springer-Verlag Berlin Heidelberg 2015 79 
V.A. Zorich, Mathematical Analysis I, Universitext, 
DOI 10.1007/978-3-662-48792-1_3 


80 3 Limits 


The values f(n) of the function f are called the terms of the sequence. It is cus- 
tomary to denote them by a symbol for an element of the set into which the mapping 
goes, endowing each symbol with the corresponding index of the argument. Thus, 
Xn := f(n). In this connection the sequence itself is denoted {x,}, and also written 
aS X1,X2,...,Xp,...- Itis called a sequence in X or a sequence of elements of X. 

The element x, is called the nth term of the sequence. 

Throughout the next few sections we shall be considering only sequences f : 
N = R of real numbers. 


Definition 2 A number A € R is called the limit of the numerical sequence {x,} if 
for every neighborhood V(A) of A there exists an index N (depending on V(A)) 
such that all terms of the sequence having index larger than N belong to the neigh- 
borhood V (A). 


We shall give an expression in formal logic for this definition below, but we first 
point out another common formulation of the definition of the limit of a sequence. 

A number A € R is called the limit of the sequence {xy} if for every ¢ > 0 there 
exists an index N such that |x, — A| < ¢ foralln > N. 

The equivalence of these two statements is easy to verify (verify it!) if we remark 
that any neighborhood V (A) of A contains some ¢-neighborhood of the point A. 

The second formulation of the definition of a limit means that no matter what 
precision ¢ > 0 we have prescribed, there exists an index N such that the absolute 
error in approximating the number A by terms of the sequence {x,} is less than ¢ as 
soonasn > N. 

We now write these formulations of the definition of a limit in the language of 
symbolic logic, agreeing that the expression “limy-—.oo X, = A” is to mean that A is 
the limit of the sequence {x,,}. Thus 


( lim x, = A) :=VV(A) JN EN Vn > N (xp € V(A)) 


n— oo 


and respectively 


( lim in =A) -= Ve >04N ENVn>N (|xy — Al <e). 
noo 
Definition 3 If limy—.o0 x, = A, we say that the sequence {x,} converges to A or 
tends to A and write x, > A asn— oo. 

A sequence having a limit is said to be convergent. A sequence that does not have 
a limit is said to be divergent. 


Let us consider some examples. 


1 


Example 1 Vimy 6 4 = 0, since |4 —O0|=;, <ewhenn>N= [4].! 


We recall that [x] is the integer part of the number x. (See Corollaries 7° and 10° of Sect. 2.2.) 
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~ ah 


Example 2 limy-.o0 "++ = 1, since |"*4 — 1| 


; <eifn> [4]. 


Example 3 limpoo(1 ++ <*) = 1, since |(1 + <*) — 1) = 4 <e whenn > [4]. 
sinn 


Example 4 limy—oo “2% = 0, since |“2" — 0| < } <e forn > [4]. 


Example 5 limy—+oo 7 =Oif|g¢|> 1. 


Let us verify this last assertion using the definition of the limit. As was shown in 
Paragraph c of Sect. 2.2.4, for every ¢ > O there exists N € N such that iY <€. 


Since |g| > 1, we shall have la O| < ar < air <eéforn > N, and the condition 
in the definition of the limit is satisfied. 


Example 6 The sequence 1, 2, 34, 5,6, ae whose nth term is x, = acy" 
n EN, is divergent. 


Proof Indeed, if A were the limit of this sequence, then, as follows from the defini- 
tion of limit, any neighborhood of A would contain all but a finite number of terms 
of the sequence. 

A number A ¥ 0 cannot be the limit of this sequence; for if ¢ = a > 0, all 
the terms of the sequence of the form pent for which pent < lA lie outside the 
e-neighborhood of A. 

But the number 0 also cannot be the limit, since, for example, there are infinitely 
many terms of the sequence lying outside the 1-neighborhood of 0. 


Example 7 One can verify similarly that the sequence 1, —1, +1, —1,..., for which 
Xn = (—1)", has no limit. 


3.1.2 Properties of the Limit of a Sequence 


a. General Properties 


We assign to this group the properties possessed not only by numerical sequences, 
but by other kinds of sequences as well, as we shall see below, although at present 
we shall study these properties only for numerical sequences. 

A sequence assuming only one value will be called a constant sequence. 


Definition 4 If there exists a number A and an index N such that x, = A for all 
n > N, the sequence {x,} will be called ultimately constant. 


Definition 5 A sequence {x,,} is bounded if there exists M such that |x,| < M for 
allnéeN. 
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Theorem 1 a) An ultimately constant sequence converges. 

b) Any neighborhood of the limit of a sequence contains all but a finite number 
of terms of the sequence. 

c) A convergent sequence cannot have two different limits. 

d) A convergent sequence is bounded. 


Proof a) If x, = A for n > N, then for any neighborhood V(A) of A we have 
Xn € V(A) when n > N, that is, limy +o X, = A. 

b) This assertion follows immediately from the definition of a convergent se- 
quence. 

c) This is the most important part of the theorem. Let limp—+o%, = A, and 
limy-+o0%n = Ar. If A; # Az, we fix nonintersecting neighborhoods V(A}1) 
and V(A2) of A; and A». These neighborhoods might be, for example, the 6- 
neighborhoods of A; and A2 for 6 < 5|A 1 — A2|. By definition of limit we find 
indices N; and N> such that x, € V(A1) for all n > N, and x, € V(Az2) for all 
n > Np. But then for N = max{N, No} we have x, € V(A1) M V(A2). But this is 
impossible, since V(A1)M V(A2) = @. 

d) Let limy-+ 99 X, = A. Setting ¢ = | in the definition of a limit, we find N such 
that |x, — A| <1 for alln > N. Then forn > N we have |x,| < |A| + 1. If we now 
take M > max{|x1|,..-, |xn|, |Al + 1} we find that |x,| < M foralln EN. 


b. Passage to the Limit and the Arithmetic Operations 


Definition 6 If {x,} and {y,} are two numerical sequences, their sum, product, and 
quotient (in accordance with the general definition of sum, product, and quotient of 
functions) are the sequences 


{Gin ty}, {Grey}. {(=)}. 


Yn 


The quotient, of course, is defined only when y, 4 0 for all n EN. 


Theorem 2 Let {x,} and {y,} be numerical sequences. If limy—+o0Xn = A and 
limy— oo Yn = B, then 


a) limp—+oo(%n + Yn) =At B; 

b) limnsooGn + Yn) =A-B; 

c) limy+oo ee = 4, provided y, £0 (n=1,2,...) and B40. 
Proof As an exercise we use the estimates for the absolute errors that arise under 
arithmetic operations with approximate values of quantities, which we already know 
(Sect. 2.2.4). 

Set |A — xn | = A(Xn), |B — yn| = AQ). Then for case a) we have 


\(A +B)-—(Ont+ yn)| < A(xn) + ACn). 
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Suppose ¢ > 0 is given. Since limy_s 99 X, = A, there exists N’ such that A(x,) < 
é/2 for all n > N’. Similarly, since limy—+oo yy = B, there exists N” such that 
A(Qyn) < €/2 for alln > N”. Then for n > max{N’, N”} we shall have 


|(A+ B) — Gn + yn)| <2, 


which, by definition of limit, proves assertion a). 
b) We know that 


|(A = B) — Gn Yn)| S ln AQn) + L¥n|A Grn) + AGn) - AQ): 


Given ¢ > 0 find numbers N’ and N” such that 
! : é 
Vn > N A(x) < min} 1, ————— } ], 
3(|B| + 1) 


" : é 
Vn > N (400) <min| 1, =|). 


Then for n > N = max{N’, N”’} we shall have 


lxn| < |A| + A@n) < JA] +1, 
lyn] <|Bl + A(yn) < |B + 1, 


A(X) - A(yn) <min{ 1, ;| -min{ 1 ;| < . 


Hence for n > N we have 
€ 
A <(|A Oo ————— 
lXn|A(yn) (| |+1) 3A| +) 3 
lynlA@n) < (|B| +1): ——=— <= 
X. —— ie 
eee 3(BI+1) 3 
€ 


A(Xn)- A(yn) < 3 


and therefore |AB — xnyyn|<e forn> WN. 
c) We use the estimate 


F Xn = IxnlAQn) + lyn|AGn)| . 1 
By ya 1—8(yn)’ 
Ayn) 


where 5(y,) = Fi 
For a given € > 0 we find numbers N’ and N” such that 


/ e|B| 
Vn > N A(x) < min ae a ; 
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aw (ago Snel e- Be 
n> < min} —, ————_} ]. 
ae 4° 16(A|+) 
Then for n > max{N’, N’’} we shall have 
lXn| < |A| + AQn) < JA] +1, 


[Bl _ |B| 
Iynl > 1B] — AGn) > 1BI- > 


) ’ 
1 2 
Se Nori 
lyn| = |BI 
A B\/4 1 
0< 380) = On) _| |/ ae 
lynl |B\/2 2 
1 
L=30n) > 5: 
and therefore 
1 4 e+ B? € 
_=A A|l+1)- : =-, 
2 B 
DG Nien eal 
Yn |B| 8 4 
1 
1 —d(yn) 
and consequently 


<e whenn>N. 
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Remark The statement of the theorem admits another, less constructive method of 
proof that is probably known to the reader from the high-school course in the rudi- 
ments of analysis. We shall mention this method when we discuss the limit of an 
arbitrary function. But here, when considering the limit of a sequence, we wished 
to call attention to the way in which bounds on the errors in the result of an arith- 
metic operations can be used to set permissible bounds on the errors in the values of 


quantities on which an operation is carried out. 


c. Passage to the Limit and Inequalities 


Theorem 3 a) Let {x,} and {y,} be two convergent sequences with limy—+o0 Xn = A 
and limn-+o0 Yn = B. If A < B, then there exists an index N €N such that xn < yn 


foralln> WN. 
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b) Suppose the sequences {xy}, {y,}, and {Zn} are such that Xj < Yn < Zn for all 
n>NeN. If the sequences {x} and {Zn} both converge to the same limit, then the 
sequence {yy} also converges to that limit. 


Proof a) Choose a number C such that A < C < B. By definition of limit, we can 
find numbers N’ and N” such that |x, — A] < C — A forall n > N’ and |y, — B| < 
B—C foralln > N”. Then forn > N = max{N’, N”} we shall have x, < A+C— 
A=CH Bath Oh y,. 

b) Suppose limy—+ oo Xn = limy—+oo Zn = A. Given € > 0 choose N’ and N” such 
that A — ¢ <x, for alln > N’ and z, < A+ée foralln > N”. Then forn > N= 
max{N’, N’’} we shall have A —€ < xp < yn < Zn < A+é, which says |y, — Al <6, 
that is A = limp—+oo yn- 


Corollary Suppose limp—+oo Xn = A and limps Yn = B. If there exists N such 
that for alln > N we have 


a) Xn > Yn, then A> B; 
b) Xn = yn, then A= B, 
C) Xx, > B, then A> B; 
d) x, > B, then A> B. 


Proof Arguing by contradiction, we obtain the first two assertions immediately 
from part a) of the theorem. The third and fourth assertions are the special cases 
of the first two obtained when y, = B. 


It is worth noting that strict inequality may become equality in the limit. For 
example J > 0 for all n EN, yet limy_, 2 i =0. 


3.1.3 Questions Involving the Existence of the Limit of a Sequence 


a. The Cauchy Criterion 


Definition 7 A sequence {x,} is called a fundamental or Cauchy sequence? if for 
any € > 0 there exists an index N € N such that |x, — x» | < ¢ whenever n > N and 
m>wN., 


Theorem 4 (Cauchy’s convergence criterion) A numerical sequence converges if 
and only if it is a Cauchy sequence. 


Bolzano introduced Cauchy sequences in an attempt to prove, without having at his disposal a 
precise concept of a real number, that a fundamental sequence converges. Cauchy gave a proof, 
taking the nested interval principle, which was later justified by Cantor, as obvious. 
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Proof Suppose limy-+o5 X, = A. Given € > 0, we find an index N such that |x, — 
Al < a forn > N. Thenifm > N andn > N, we have |x» —Xn| < |X%m — A] +|xn - 
A| < 5 + 5 =e, and we have thus verified that the sequence is a Cauchy sequence. 

Now let {x,} be a fundamental sequence. Given ¢ > 0, we find an index N such 
that |xm — xx| < ; when m > N and k > N. Fixing m = N, we find that for any 
k>WN 


xy — 5 <xE<IN +5, 3.1) 


but since only a finite number of terms of the sequence have indices not larger 
than N, we have shown that a fundamental sequence is bounded. 

For n € N we now set dy := infgsn x, and by := supys, Xk- 

It is clear from these definitions that ad, < day41 < Dns 1 < Dy (since the greatest 
lower bound does not decrease and the least upper bound does not increase when we 
pass to a smaller set). By the nested interval principle, there is a point A common to 
all of the closed intervals [a,, by |. 

Since 


a <A<by 
for any n € N and 


An = inf xp < xR < sup xz = Dx 
k>n k>n 


for k > n, it follows that 
|A — xx| < bn — an. (3.2) 
But it follows from Eq. (3.1) that 


g , E 
xn — = < inf x, =ay < by =supxy <xN4+ = 
3 ~ kn k>n 3 


for n > N, and therefore 
2€ 
bn — dn < aoe (3.3) 
for n > m. Comparing Eqs. (3.2) and (3.3), we find that 


|A — xz| <6, 


for any k > N, and we have proved that limy_+o0 x4 = A. 


Example 8 The sequence (—1)” (n= 1, 2,...) has no limit, since it is not a Cauchy 
sequence. Even though this fact is obvious, we shall give a formal verification. The 
negation of the statement that {x,} is a Cauchy sequence is the following: 


de > 0VN EN An > N Am>N (|xm —xnl = 6), 
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that is, there exists ¢ > 0 such that for any N € N two numbers n, m larger than N 
exist for which |x» — xXy| > &. 

In our case it suffices to set ¢ = 1. Then for any N € N we shall have |xy+1 — 
xy42l=[1—(-D| =2>1=e. 


Example 9 Let 
xy=0.a1, x2=O.aja2, x3=O0.aja203, ..., xX, =O0.ajQa2...ay, 
be a sequence of finite binary fractions in which each successive fraction is obtained 


by adjoining a 0 or a | to its predecessor. We shall show that such a sequence always 
converges. Let m > n. Let us estimate the difference x», — Xn: 


On+1 a 

Xm — Xn| = aie a < 
1 1 Cam _ Gynt} 1 
7s ey i 1-5 Qn° 


Thus, given ¢ > 0, if we choose N so that ww < €, we obtain the estimate |x, — 


Xn| < + < ow < é for all m > n> N, which proves that the sequence {x,} is a 


Cauchy sequence. 


Example 10 Consider the sequence {x,}, where 


1 1 
Xn =ltatet—. 
2 n 
Since 
| | 1 Geena 1 1 1 
x —x = reiete >n- = . 
= i n+1 n+n 2n 2 


for all n € N, the Cauchy criterion implies immediately that this sequence does not 
have a limit. 


b. A Criterion for the Existence of the Limit of a Monotonic Sequence 


Definition 8 A sequence {x,} is increasing if xX) < Xn+1 for all n € N, nondecreas- 
ing if X) < Xn41 for alln EN, nonincreasing if xy > Xy+ for alln € N, and decreas- 


ing if X, > Xn+1 for all n € N. Sequences of these four types are called monotonic 
sequences. 


Definition 9 A sequence {x,} is bounded above if there exists a number M such 
that x, < M forallnéeN. 
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Theorem 5 (Weierstrass) In order for a nondecreasing sequence to have a limit it 
is necessary and sufficient that it be bounded above. 


Proof The fact that any convergent sequence is bounded was proved above under 
general properties of the limit of a sequence. For that reason only the sufficiency 
assertion is of interest. 

By hypothesis the set of values of the sequence {x,} is bounded above and hence 
has a least upper bound s = sup, ey Xn. 

By definition of the least upper bound, for every ¢ > O there exists an element 
xn € {xy} such that s — ¢ < xy <-s. Since the sequence {x,} is nondecreasing, we 
now find that s—¢ <xy <x, <s foralln > N. That is, |s — x,| =s — Xx, <e. 
Thus we have proved that lim,-, 99 x, = 5. 


Of course an analogous theorem can be stated and proved for a nonincreasing 
sequence that is bounded below. In this case limp. 99 X, = infypen Xy- 


Remark The boundedness from above (resp. below) of a nondecreasing (resp. non- 
increasing) sequence is obviously equivalent to the boundedness of that sequence. 
Let us consider some useful examples. 


n 
gn 


Example 11 limy-+oo r 


=Oifg>1. 


x, for n EN. Since limp_+o a = 


limn+oo(1 + $) = lity+oo(1 + 3): lits00 9 = 1-3 =} <1, there exists an 


n+l 
nq 


Proof Indeed, if x, = me then x,41 = 
index N such that ut <1 forn > N. Thus we shall have x41 <x, forn > N, so 
that the sequence will be monotonically decreasing from index N on. As one can 
see from the definition of a limit, a finite set of terms of a sequence has no effect the 
convergence of a sequence or its limit, so that it now suffices to find the limit of the 
sequence XN4+1 >xXN42>°°°. 

The terms of this sequence are positive, that is, the sequence is bounded below. 
Therefore it has a limit. 


Let x = limy_s 0 Xn. It now follows from the relation x,4) = net xn that 


. . n+1 . ; 1 
x= lim (%,41) = lim Xn) = lim - lim x, = —x, 
n—>oo n—>oo nq n—->oco nq n—>oo qd 


from which we find (1 — wx = 0, and sox =0. 


Corollary 1 
lim Yn=1. 


n> oo 
Proof By what was just proved, for a given ¢ > O there exists N € N such that 
1<n< (1+ 6)" for all n > N. Then for n > N we obtain 1 < 2/n < 1+ € and 
hence limy—.o9 ¢/n = 1. 
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Corollary 2 
lim */a=1 foranya>0. 
noo 


Proof Assume first that a > 1. For any ¢ > 0 there exists N € N such that 1 <a < 
(1+ )" for all n > N, and we then have 1 < 2/a <1+¢ foralln > N, which says 


limp+oo V/a= 1. 
For 0 <a < 1, we have | < i, and then 


1 


lim 2/a= lim a 
n—> oo noo 1 n 
- limn—oo 
Example 12 limy-+oo r = 0; here g is any real number, n €N, and n! :=1-2- 


Proof Yt q = 0, the assertion is obvious. Further, since IS |= lal" , it suffices to 
pee the assertion for g > 0. Reasoning as in Example 11, we remark that x,+1 = 
X,. Since the - of natural numbers is not bounded above, there exists an index 


FEE 
N such that 0 < wt <1 for alln > N. Then for n > N we shall have xn+41 < Xn, 


and since the terms of the sequence are positive, one can now guarantee that the 
limit limy-+o9 Xn = X exists. But then 


x= lim | Xn41 = lim Xn = lim - lim x, =0-x =0. 
n>oon + n>on + n—>oo 


c. The Number e 


Example 13 Let us prove that the limit limy—o9(1 + iy" exists. 

In this case the limit is a number denoted by the letter e, after Euler. This number 
is just as central to analysis as the number | to arithmetic or z to geometry. We shall 
revisit it many times for a wide variety of reasons. 

We begin by verifying the following inequality, sometimes called Jacob Ber- 
noulli’s inequality: 


(l+a)">1l+na forneNanda>-—l. 


Proof The assertion is true for n = 1. If it holds for n € N, then it must also hold for 
n+ 1, since we then have 


(l+a)"*!=(1+a)(1+a)" > (1+a)(1+na) = 
=14+(n+Da+tna?>14+(n4 la. 


3Jacob (James) Bernoulli (1654-1705) — Swiss mathematician, a member of the famous Bernoulli 
family of scholars. He was one of the founders of the calculus of variations and probability theory. 
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By the principle of induction the assertion is true for all n € N. 
Incidentally, the computation shows that strict inequality holds if a #0 and 
n>. 


We now show that the sequence y, = (1 + iyntl is decreasing. 


Proof Let n > 2. Using Bernoulli’s inequality, we find that 


yet. Cay, ne ee ae 
Ya (1+ Atl @?- 1 n+l 


n2—1 
ee n n ie n 4 
is n2—1 Pe et n/n+1. 


Since the terms of the sequence are positive, the limit limy_-.5.(1 + iyntl exists. 
But we then have 


1 n 1 n+1 1 -1 
lim (1+-) = lim (1+-) (1+-) = 
n—oo n noo n n 
1 n+1 1 1 n+l 
= lim (1+-) - lim T= lim (1+-) ; 
n—>oo n n—>oo | += noo n 
n 


Thus we make the following definition: 


Definition 10 


1 n 
e:= lim (1+-) : 
noo n 


d. Subsequences and Partial Limits of a Sequence 


Definition 11 If x1, x2,...,2%n,... is a sequence and ny <2 <-+-<mg <--- an 
increasing sequence of natural numbers, then the sequence xy,,Xn,,.--,Xnys +++ 18 
called a subsequence of the sequence {xp}. 


For example, the sequence 1,3,5,... of positive odd integers in their natural 
order is a subsequence of the sequence 1, 2,3,.... But the sequence 3, 1,5,7,9,... 
it not a subsequence of this sequence. 


Lemma 1 (Bolzano—Weierstrass) Every bounded sequence of real numbers con- 
tains a convergent subsequence. 
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Proof Let E be the set of values of the bounded sequence {x,}. If E is finite, there 
exists a point x € FE and a sequence nj <n <--- of indices such that x,, = X,. = 
--- =x. The subsequence {x,,} is constant and hence converges. 

If E is infinite, then by the Bolzano—Weierstrass principle it has a limit point x. 
Since x is a limit point of E, one can choose n; € N such that |x,, — x| < 1. If 


nx € N have been chosen so that |x,, — x| < ie then, because x is a limit point of 


: 1 
E, there exists ng41 € N such that np < n+) and IXnypyy —X| < Fat: 


Since limg-.o6 —_ O, the sequence X7,,%n5,...,Xn,,... SO constructed con- 
k 1 2 k 
verges to x. 


Definition 12 We shall write x, — ++oo and say that the sequence {x,} tends to 
positive infinity if for each number c there exists N € N such that x, > c for all 
n>QN. 


Let us write this and two analogous definitions in logical notation: 


(X%» > +00) := Wee RAN EN Vn > WN (c < Xn), 
(X%, > —0co) := Veo ERAN ENVn>N (x% <0), 
(X%,» > 00) :=VcoERANENVn>N (c< Ixnl). 


In the last two cases we say that the sequence {x,} tends to negative infinity and 
tends to infinity respectively. 

We remark that a sequence may be unbounded and yet not tend to positive infin- 
ity, negative infinity, or infinity. An example is x, =n)". 

Sequences that tend to infinity will not be considered convergent. 

It is easy to see that these definitions enable us to supplement Lemma |, stating 
it in a slightly different form. 


Lemma 2 From each sequence of real numbers one can extract either a convergent 
subsequence or a subsequence that tends to infinity. 


Proof The new case here occurs when the sequence {x,,} is not bounded. Then for 
each k € N we can choose nx € N such that |xp,| > k and ny < ny41. We then obtain 
a subsequence {xn,} that tends to infinity. 


Let {x;} be an arbitrary sequence of real numbers. If it is bounded below, one 
can consider the sequence iy = infy>, xx (which we have already encountered in 
proving the Cauchy convergence criterion). Since i, <in41 for any n €N, either 


the sequence {i,} has a finite limit limy—.o0 in =1, OF in > +00. 


Definition 13 The number / = limy-+o0 infg>n xx is called the inferior limit of the 
sequence {x,} and denoted lim,_, ., xz or lim infy_, 40 xx. If in + +00, it is said that 
the inferior limit of the sequence equals positive infinity, and we write lim, _, ., xx = 
+oo or lim infy_.o0 x, = +00. If the original sequence {x;} is not bounded below, 
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then we shall have i, = inf>, x, = —oo for all n. In that case we say that the 
inferior limit of the sequence equals negative infinity and write lim, _, ., x, = —0o 
or liminfx_s 99 Xp = —OO. 


Thus, taking account of all the possibilities just enumerated, we can now write 
down briefly the definition of the inferior limit of a sequence {x,}: 


lim x, := lim inf xz. 
ese n>ok>n 


Similarly by considering the sequence s, = sup;s, Xx, We arrive at the definition 
of the superior limit of the sequence {x;}: 


Definition 14 


lim x, := lim sup xx. 
k>oo noo k>n 


We now give several examples: 
Example 14 xz, = (—1)*,k EN: 


lim x, = lim inf x, = lim inf(— pk = jim n (— lIj=- 
k->00 n—>Owk>n n—>Owk>n 


im x, = lim supx, = jim, sup(— 1)* = lim 1=1. 
k-0o n>oo k>n Ck>n noo 


Example 15 xx = KODE keN: 


lim kK De = lim inf k@ ee = lim 0=0, 


eS ~~ n> 00 k>n ~~ n00 


lim k—- Dis = lim sup k{- 1s = lim n (+00) = +00. 


k->o0o "100 pon 


Example 16 xp =k, k EN: 


lim k= lim infk= lim n=+00, 
k>00 n>Ok>n noo 


lim k= lim supk = im (+00) = +00. 


k—>oo noo k>n 


= lim inf 


E365 n>Ok>n n> oo Far? ifn =2m 


(-1)* (—1)k —1 aa 
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__ (—1)* (-1)* 1 ifn =2m 
1 


k>00 «6k N>Ops, k noo} 1. ifn=2m+1 


Example 18 x, = —k*,keEN: 


lim (—k*) = lim_inf(—k’) = —oo. 


k->00 noo kan 
Example 19 xz = (—1)*k, k EN: 
lim (—1)*k = lim inf(—1)*k = lim (—00) = —o0, 
aes n>OOk>n noo 


k->0o 


lim (—1)*k = lim sup(—1)*k = lim (+00) = +00. 
k->0o NO k>n n> oo 


To explain the origin of the terms “superior” and “inferior” limit of a sequence, 
we make the following definition. 


Definition 15 A number (or the symbol —oo or +00) is called a partial limit of a 
sequence, if the sequence contains a subsequence converging to that number. 


Proposition 1 The inferior and superior limits of a bounded sequence are respec- 
tively the smallest and largest partial limits of the sequence.* 


Proof Let us prove this, for example, for the inferior limit i = lim,_, ,., xx. What 
we know about the sequence i, = infy>, xz is that it is nondecreasing and that 
limy—+oo in = 1 € R. For the numbers n € N, using the definition of the greatest 
lower bound, we choose by induction numbers k, € N such that ky < ky, and 
ik, < Xk < Ue + 1 (Taking i; we find k;; taking iz,+1 we find kz, etc.) Since 
limy+o0 in = limysoo(in + ‘) =i, we can assert, by properties of limits, that 
limy—oo Xk, = i. We have thus proved that i is a partial limit of the sequence {x;}. 
It is the smallest partial limit since for every ¢ > O there exists n € N such that 
i —€ <in, that isi — € <in = infksy xy < xx for any k >n. 

The inequality i — ¢ < x, for k > n means that no partial limit of the sequence 
can be less than i — ¢. But ¢ > 0 is arbitrary, and hence no partial limit can be less 
than i. 

The proof for the superior limit is of course analogous. 


We now remark that if a sequence is not bounded below, then one can select a 
subsequence of it tending to —oo. But in this case we also have lim,_, ,, x, = —©, 
and we can make the convention that the inferior limit is once again the smallest 
partial limit. The superior limit may be finite; if so, by what has been proved it 


4Here we are assuming the natural relations —oo < x < +oo between the symbols —oo, +00 and 
numbers x € R. 
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must be the largest partial limit. But it may also be infinite. If limy_, 45 x, = +00, 
then the sequence is also unbounded from above, and one can select a subsequence 
tending to +oo. Finally, if limgs00 Xk = —c, which is also possible, this means 
that sup; >, Xk = Sn —> —0o, that is, the sequence {x,} itself tends to —oo, since 
Sy > Xp. Similarly, if lim, ., x" = +00, then x, > +00. 

Taking account of what has just been said we deduce the following proposition. 


Proposition 1’ For any sequence, the inferior limit is the smallest of its partial 
limits and the superior limit is the largest of its partial limits. 


Corollary 3 A sequence has a limit or tends to negative or positive infinity if and 
only if its inferior and superior limits are the same. 


Proof The cases when lim, ,.x%% = limp_5o0X¢ = +00 or limy_, 46 %k = 
limg_s00 Xk = —O have been investigated above, and so we may assume that 
Timy_, 45 Xk = limgsoo Xe = A ER. Since in = infksn xK S Xn S SUPgsy Xk = Sn 
and by hypothesis limy—o0 in = limy—+oo Sn = A, we also have limy-o0 Xn = A by 
properties of limits. 


Corollary 4 A sequence converges if and only if every subsequence of it converges. 


Proof The inferior and superior limits of a subsequence lie between those of the 
sequence itself. If the sequence converges, its inferior and superior limits are the 
same, and so those of the subsequence must also be the same, proving that the 
subsequence converges. Moreover, the limit of the subsequence must be the same 
as that of the sequence itself. 

The converse assertion is obvious, since the subsequence can be chosen as the 
sequence itself. 


Corollary 5 The Bolzano—Weierstrass Lemma in its restricted and wider formula- 
tions follows from Propositions | and 1' respectively. 


Proof Indeed, if the sequence {x;} is bounded, then the points i = lim,_, ,, x, and 
5 = limg-soo xg are finite and, by what has been proved, are partial limits of the 
sequence. Only when i = s does the sequence have a unique limit point. When 
i <s there are at least two. 

If the sequence is unbounded on one side or the other, there exists a subsequence 
tending to the corresponding infinity. 


Concluding Remarks We have carried out all three points of the program outlined 
at the beginning of this section (and even gone beyond it in some ways). We have 
given a precise definition of the limit of a sequence, proved that the limit is unique, 
explained the connection between the limit operation and the structure of the set of 
real numbers, and obtained a criterion for convergence of a sequence. 

We now study a special type of sequence that is frequently encountered and very 
useful — a series. 
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3.1.4 Elementary Facts About Series 


a. The Sum of a Series and the Cauchy Criterion for Convergence of a Series 


Let {ay} be a sequence of real numbers. We recall that the sum dp + dp41 +--+ + 
dq, (p <q) is denoted by the symbol es ayn. We now wish to give a precise 
meaning to the expression a, + a2 +---+a,+---, which expresses the sum of all 
the terms of the sequence {ay}. 


Definition 16 The expression a; + a2 +--+: +a, +--- is denoted by the symbol 
ye dn and usually called a series or an infinite series (in order to emphasize its 
difference from the sum of a finite number of terms). 


Definition 17 The elements of the sequence {a,}, when regarded as elements of the 
series, are called the terms of the series. The element ay is called the nth term. 


Definition 18 The sum s, = ae 1 & is called the partial sum of the series, or, 
when one wishes to exhibit its index, the nth partial sum of the series.” 


Definition 19 If the sequence {s,} of partial sums of a series converges, we say the 
series is convergent. If the sequence {s,} does not have a limit, we say the series is 
divergent. 


Definition 20 The limit lim,_, 0 5) = s of the sequence of partial sums of the se- 
ries, if it exists, is called the sum of the series. 


It is in this sense that we shall henceforth understand the expression 


Since convergence of a series is equivalent to convergence of its sequence of 
partial sums {s,}, applying the Cauchy convergence criterion to the sequence {s,} 
yields the following theorem. 


Theorem 6 (The Cauchy convergence criterion for a series) The series aj +---+ 
dn + +++ converges if and only if for every ¢ > 0 there exists N € N such that the 
inequalities m >n > N imply |an +---+am| <6. 


Corollary 6 [f only a finite number of terms of a series are changed, the resulting 
new series will converge if the original series did and diverge if it diverged. 


5Thus we are actually defining a series to be an ordered pair ({ay}, {s,}) of sequences connected 
by the relation (s, = yo 4 ax) for alln EN. 


96 3 Limits 


Proof For the proof it suffices to assume that the number WN in the Cauchy conver- 
gence criterion is larger than the largest index among the terms that were altered. 


Corollary 7 A necessary condition for convergence of the series ay +-+-+an,+-:: 
is that the terms tend to zero as n —> OW, that is, it is necessary that limMp—+oo An = 0. 


Proof It suffices to set m = n in the Cauchy convergence criterion and use the defi- 
nition of the limit of a sequence. 


Here is another proof: a, = sy — Sn—1, and, given that limy—.oo Sn = 5, we have 
limps 00 An = limy-s 90 (Sn — Sp—1) = limps 99 Sn — Limp 99 Sp] = 8 — 5 = 0. 


Example 20 The series 1+ g + q* +---+ q"+->- is often called the geometric 
series. Let us investigate its convergence. 

Since |g”| = |q|", we have |g”| > 1 when |g| > 1, and in this case the necessary 
condition for convergence is not met. 

Now suppose |q| < 1. Then 


and limy-s 99 Sn = To since limy-+o0 gq” = O if |g| < 1. 

Thus the series )°-° , q"—' converges if and only if |g| < 1, and in that case its 
sum is i: 

—4q 

Example 21 The series 1 + 5 teeet i +--+ is called the harmonic series, since 
each term from the second on is the harmonic mean of the two terms on either side 
of it (see Exercise 6 at the end of this section). 

The terms of the series tend to zero, but the sequence of partial sums 


1 1 
Sai Lee oa 


as was shown in Example 10, diverges. This means that in this case s, — +00 as 
n> o. 
Thus the harmonic series diverges. 


Example 22 The series 1 —1+1—---+(—1)"t!+.-- diverges, as can be seen 
both from the sequence of partial sums 1, 0, 1, 0, ... and from the fact that the terms 
do not tend to zero. 

If we insert parentheses and consider the new series 


(-)+0-D+e-, 


whose terms are the sums enclosed in parentheses, this new series does converge, 
and its sum is obviously zero. 
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If we insert the parentheses in a different way and consider the series 
1+ (-14D4+(C14D4-", 


the result is a convergent series with sum 1. 
If we move all the terms that are equal to —1 in the original series two places to 
the right, we obtain the series 


1+1-1+4+1-14+1-.-.-., 
we can then, by inserting parentheses, arrive at the series 
G(+D+C14+)+Cl4+D4+--, 


whose sum equals 2. 


These observations show that the usual laws for dealing with finite sums can in 
general not be extended to series. 

There is nevertheless an important type of series that can be handled exactly like 
finite sums, as we shall see below. These are the so-called absolutely convergent 
series. They are the ones we shall mainly work with. 


b. Absolute Convergence. The Comparison Theorem and Its Consequences 


Definition 21 The series )°°° , dy is absolutely convergent if the series )77°_ | |an| 
converges. 


Since |an +---+@m| < |an|+--+|am|, the Cauchy convergence criterion implies 
that an absolutely convergent series converges. 

The converse of this statement is generally not true, that is, absolute convergence 
is a stronger requirement than mere convergence, as one can show by an example. 


Example 23 The series | — 1+ 5 5 + ; i +.---, whose partial sums are either 


i or 0, converges to 0. 
At the same time, the series of absolute values of its terms 


(ip eoeee s 
25S 8 


diverges, as follows from the Cauchy convergence criterion, just as in the case of 
the harmonic series: 


1 1 1 1 
oe + 
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To learn how to determine whether a series converges absolutely or not, it suffices 
to learn how to investigate the convergence of series with nonnegative terms. The 
following theorem holds. 


Theorem 7 (Criterion for convergence of series of nonnegative terms) A series aj + 
+--+ a, +--+ whose terms are nonnegative converges if and only if the sequence of 
partial sums is bounded above. 


Proof This follows from the definition of convergence of a series and the criterion 
for convergence of a nondecreasing sequence, which the sequence of partial sums 
is, in this case: 5] <so9<---<5Sy,<---. 


This criterion implies the following simple theorem, which is very useful in prac- 
tice. 


Theorem 8 (Comparison theorem) Let )°°° dn and ~~, by be two series with 
nonnegative terms. If there exists an index N €N such that ay < by for alln > N, 
then the convergence of the series )\-°_, bn implies the convergence of Y-°~., dn, 
and the divergence of )~~~_ dn implies the divergence of \--~_; bn. 


Proof Since a finite number of terms has no effect on the convergence of a se- 
ries, we can assume with no loss of generality that a, < b, for every index n € N. 
Then Ay = op) ak < Dp) De = Bn. If the series }°”~ | bn converges, then the se- 
quence {B,,}, which is nondecreasing, tends to a limit B. But then A, < B, < B for 
all n € N, and so the sequence A, of partial sums of the series par dy 18 bounded. 
By the criterion for convergence of a series with nonnegative terms (Theorem 7), 
the series )°°° | ay converges. 

The second assertion of the theorem follows from what has just been proved 
through proof by contradiction. 


saa 24 Since <4 < —1— for n > 2, we conclude that the series 


1 
ee (n—I1)n 
are =z and rls a = converge or diverge together. 
1 11 
But the latter series can be aa directly, by ai as that E&tD) =k AT? 
and therefore )77_ WEED =1-; ae Hence )°°- | eh sy = = Consequently the 
1 


series ) 4 a converges. It is interesting that }°P_, a= ad as will be proved 
below. 


Example 25 It should be observed that the comparison theorem applies only to se- 
ries with nonnegative terms. Indeed, if we set ay = —n and b, = 0, for example, we 
have dy, < by, and the series pet bn converges while past dn diverges. 


Corollary 8 (The Weierstrass M-test for absolute convergence) Let )~- | an and 
Yo by be series. Suppose there exists an index N € N such that |ay| < by for all 
n> N. Then a sufficient condition for absolute convergence of the series yo 14n 
is that the series }°r~_, by converge. 
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Proof In fact, by the comparison theorem the series °° , |a,| will then converge, 
and that is what is meant by the absolute convergence of )°-° | dn. 


This important sufficiency test for absolute convergence is often stated briefly as 
follows: If the terms of a series are majorized (in absolute value) by the terms of a 
convergent numerical series, then the original series converges absolutely. 


Example 26 The series )°°° ; sin converges absolutely, since AI < < =z and the 


series ete —z converges, as we saw in Example 24. 


Corollary 9 (Cauchy’s test) Let °°, dn be a given series anda = limn— oo */JanI- 
Then the following are true: 


a) ifa <1, the series yt an converges absolutely; 
b) ifa > 1, the series poet dn diverges; 
c) there exist both absolutely convergent and divergent series for which a = 1. 


Proof a) If a < 1, we can choose g € R such that a < q < 1. Fixing q, by definition 
of the superior limit, we find N € N such that %/Ja,| <q for all n > N. Thus we 
shall have |a,| <q" forn > N, and since the series }°°°_, g” converges for |g| < 1, 
it follows from the comparison theorem or from the Weierstrass criterion that the 
series }“- | dn converges absolutely. 

b) Since @ is a partial limit of the sequence {/|a,|} (Proposition 1), there exists 
a subsequence {d,,} such that limy—,o "Y/ lan, | =a. Hence if a > 1, there exists 
K €N such that |a,,| > 1 for all k > K, and so the necessary condition for conver- 
gence (a, — 0) does not hold for the series dina dn. It therefore a 


c) We already know that the series }~°° 1 diverges and a 4 converges 


n=1 n=1 


(absolutely, since I5|= = sr). At the same time, limy—soo 7/4 = limy—+oo Ti =1 


and iti ,-s56-44 4. = limy- 9 73 = lim, 00 WR = 


Example 27 Let us investigate the values of x € R for which the series 


[ee 


Ye+eny's" 


n=1 


converges. _ 
We compute a = limy—oo */|(2 + (H1I)”)"x"| = |x| limp—oo 12+ (—1)”"| = 3] x]. 
Thus for |x| < 5 the series converges and even absolutely, while for |x| > 3 the 


series diverges. The case |x| = : requires peated consideration. In the present 
case that is an elementary task, since for |x| = = + and n even (n = 2k), we have 


(2+ (- 1)2*)* ak — 3° (3K = |. Therefore ae series diverges, since it does not 
fulfill the necessary condition for convergence. 
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Corollary 10 (d’ Alembert’s® test) Suppose the limit limy_s ¢¢ | 
the series ¥-r~_, dn. Then, 


Gn+1 | _ ‘ 
a | =a exists for 


a) ifa <1, the series par ay converges absolutely; 
b) ifa > 1, the series at an diverges; 
c) there exist both absolutely convergent and divergent series for whicha = 1. 


Proof a) If a < 1, there exists a number g such that a < q < |. Fixing q and using 
properties of limits, we find an index N € N such that | Sat | <q forn>WN.Sincea 
finite number of terms has no effect on the convergence ofa series, we shall assume 
without loss of generality that | Cutt | <q foralln EN. 


Since 
an 


a2 Gn+1 


a) 


Qn+1 
an 


’ 


an-1 a\ 


we find that |an+1| < |ai|-q". But the series }°° , |ai|q" converges (its sum is 
| 


obviously c 1 7 ), so that the series }“P° | a, converges absolutely. 


b) If w > 1, then from some index N € N on we have | “ee | > 1, that is, |a,| < 
n 
|an+1|, and the condition a, — 0, which is necessary for convergence, does not hold 
. le) 
for the series \77" | an. 
c) As in the case of Cauchy’s test, the series °°, + and 7°, 4, provide ex- 
amples. 


Example 28 Let us determine the values of x € R for which the series 


[o,@) 
1 
aa 
n! 
n=1 
converges. 
For x = 0 it obviously converges absolutely. 
For x #0 we have limy—oo |*| = limtn—oo al =0. 
Thus, this series converges absolutely for every value of x € R. 


Finally, let us consider another special, but frequently encountered class of series, 
namely those whose terms form a monotonic sequence. For such series we have the 
following necessary and sufficient condition: 


Proposition 2 (Cauchy) [f a, > az >---> 0, the series pa ay converges if and 
only if the series aa OF aay =a, + 2a2 + 4a4 + 8ag + --- converges. 


Proof Since 


a2 a2 <4), 


6J.L. d’Alembert (1717-1783) — French scholar specializing in mechanics. He was a member of 
the group of philosophers who wrote the Encyclopédie. 
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244 < a3 +4 < 2a, 


4ag < a5 +aeo +a7z+ag <4a4, 


2" dont Sdgngy tees apni S 2” aon : 


by adding these inequalities, we find 
5 Sn —ai)< Apn+! — a, < Sn, 


where Ay = ay +--- + ay and S, = a, + 2a2 +--+ +2” an are the partial sums 
of the two series in question. The sequences {Ax} and {S,} are nondecreasing, and 
hence from these inequalities one can conclude that they are either both bounded 
above or both unbounded above. Then, by the criterion for convergence of series 
with nonnegative terms, it follows that the two series indeed converge or diverge 
together. 


This result implies a useful corollary. 


Corollary The series )-° converges for p > 1 and diverges for p < 1./ 


n=l a 
Proof If p = 0, the proposition implies that the series converges or diverges simul- 
taneously with the series 


[e,e) 


a= Ley 


k=0 k=0 


and a necessary and sufficient condition for the convergence of this series is that 
g =2!-P <1, thatis, p> 1. 

If p <0, the divergence of the series }~°° 
the series are not smaller than 1. 


n=1 3 | is obvious, since all the terms of 


The importance of this corollary is that the series )°°° ; = is often used as a 


comparison series to study the convergence of other series. 


tL 
nP 


c. The Number e as the Sum of a Series 


To conclude our study of series we return once again to the number e and obtain a 
series the provides a very convenient way of computing it. 


7Up to now in this book the number n? has been defined formally only for rational values of p, so 
that for the moment the reader is entitled to take this proposition as applying only to values of p 
for which n? is defined. 
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We shall use Newton’s binomial formula to expand the expression (1 + ty", 
Those who are unfamiliar with this formula from high school and have not solved 
part g) of Exercise 1 in Sect. 2.2 may omit the present appendix on the number e 
with no loss of continuity and return to it after studying Taylor’s formula, of which 
Newton’s binomial formula may be regarded as a special case. 

We know that e = limyo0(1 + 4)". 

By Newton’s binomial formula 


pe) ae pte Le ae 
n) I!n 2 2 


n(n—1)---(n—k+1) 1 1 
_m@ De @=ktD 
k} nk n” 


ee eee ee errr me | 
= 2 ; ki i y Cane 


Setting (1+ iy =e, andl +1+---y +--+ 4 = Sy, we thus have en < Sy 
(n=1,2,...). 

On the other hand, for any fixed k and n > k, as can be seen from the same 
expansion, we have 


t+] 1 1 1 1 1 1 1 k-1 
+ Ty 7s a saa = ane —— < en. 


As n — oo the left-hand side of this inequality tends to s; and the right-hand side 
to e. We can now conclude that s; <e for all k EN. 
But then from the relations 


Cn <Sn Se 


we find that limy— 4 5, =e. 
In accordance with the definition of the sum of a series, we can now write 


ree 1 
e= Poet ee 


This representation of the number e is very well adapted for computation. 
Let us estimate the difference e — s,: 
— @+I! @4+2L00 


O0<e-Sy 


1 1 1 
= col!+saot aenesst|< 


3.1 The Limit of a Sequence 103 


1 1 1 
1 ae 
<oopl+ostapt 
1 1 n+2 1 
= => < . 
1 2 
(n+D!1— ae ni(n + 1) nin 


Thus, in order to make the absolute error in the approximation of e by sy, less 
than, say 1073, it suffices that + < ae This condition is already satisfied by s6. 
Let us write out the first few decimal digits of e: 


e = 2.7182818284590.... 


This estimate of the difference e — s,, can be written as the equality 
On 
e=5, +—, where0 <6, <1. 
nin 


It follows immediately from this representation of e that it is irrational. Indeed, 
if we assume that e = ae where p,q € N, then the number q!e must be an integer, 
while 


0, qq: q! qq! @ 
le=q!(s,+—-)=q!+—+—4+--4+—-4-+4 
4 a(n a a ee) qq 
and then the number “ would have to be an integer, which is impossible. 
For the reader’s snronmatioh we note that e is not only irrational, but also tran- 
scendental. 


3.1.5 Problems and Exercises 


1. Show that a number x € R is rational if and only if its g-ary expression in any 
base q is periodic, that is, from some rank on it consists of periodically repeating 
digits. 

2. A ball that has fallen from height h bounces to height gh, where g is a con- 
stant coefficient 0 < g < 1. Find the time that elapses until it comes to rest and the 
distance it travels through the air during that time. 

3. We mark all the points on a circle obtained from a fixed point by rotations of the 
circle through angles of n radians, where n € Z ranges over all integers. Describe 
all the limit points of the set so constructed. 

4. The expression 


nyt 
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where n; EN, is called a finite continued fraction, and the expression 


1 
 ——— 
nz + —— 
n3+ 


is called an infinite continued fraction. The fractions obtained from a continued 
fraction by omitting all its elements from a certain one on are called the convergents. 
The value assigned to an infinite continued fraction is the limit of its convergents. 
Show that: 


a) Every rational number “, where m,n € N can be expanded in a unique man- 
ner as a continued fraction: 
m 1 
n QS 


1 
Qn-1t Gy, 


assuming that g, £1 forn > 1. 
Hint: The numbers qg, ..., ¢,, called the incomplete quotients or elements, can be 


obtained from the Euclidean algorithm 


m=n-qt+nri, 
n=1r|-qo+/r2, 


rp=12°q3 +73, 


by writing it in the form 


b) The convergents Rj = qi, Rx =qi t+ ae ... Satisfy the inequalities 
m 
Ry < R3 <-++ < Rog-1 < — < Rox < Rog-2 < +++ < Ro. 
n 


c) The numerators Py and denominators Q, of the convergents are formed ac- 
cording to the following rule: 


P= Pride + Pr-2, P2= 4192, Pi=4q1, 
Ox = Ox—-19k + Ok-2, Q2=42, Q,=1. 
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d) The difference of successive convergents can be computed from the formula 


(-D* 
Ry — Re-1 = ———._ ({k > I). 
Ox Qk-1 
e) Every infinite continued fraction has a determinate value. 
f) The value of an infinite continued fraction is irrational. 
g) 
1+V5 _ 1 


1 
2 +e 


h) The Fibonacci numbers 1, 1,2,3,5,8,... (that is, uw, = uUj—1 + Uy—2 and 
uy = U2 = 1), which are obtained as the denominators of the convergents in g), 
are given by the formula 


ol) CS) 


14.75 
2 


i) The convergents Ry = zt in g) are such that | 


this result with the assertions of Exercise 11 in Sect. 2.2. 


5. Show that 


Py 1 
0; | > ORS" Compare 


a) the equality 


Pea eee : : 
1! 2! ntonin | 1-2-2! (n—1)-n-n! 
holds for n > 2; 
1 : 
b) e=3 ee, (+l) n+2)(n+2)!? 
c) for computing the number e approximately the formula e ~ 1 + i + x + 
1 1 1 


at + 7m iS much better than the original formula e ~ 1 + ti + a teeta. 
(Estimate the errors, and compare the result with the value of e given on p. 103.) 


6. If a and b are positive numbers and p an arbitrary nonzero real number, then the 
mean of order p of the numbers a and b is the quantity 


1 
aP +bP\p 
S,(a, b) = F 
In particular for p = | we obtain the arithmetic mean of a and b, for p = 2 their 
square-mean, and for p = —1 their harmonic mean. 


a) Show that the mean S,,(a, b) of any order lies between the numbers a and b. 
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b) Find the limits of the sequences 
{Sn(a, b)}, {S_n(a, b)}. 


7. Show that if a > 0, the sequence x,41 = 5 (Xn + =) converges to the square root 
of a for any x; > 0. 

Estimate the rate of convergence, that is, the magnitude of the absolute error 
[Xn — /a| = |An| as a function of n. 
8. Show that 


a) 
So(n) = 19 +---+n%° =n, 

na+l) 1, 1 
==xn 


Si@sP +--+ =; an ton, 
n(n+1)(2n+1) 1 1 1 

S = 1? oye = — 3 2 ; 

2(n) +e-+n r 3” +50 + en 


wWint+l1)y 1 


1 1 
S$3(n) = Z qn + 57 + qn 


and in general that 

Sx(n) = agg in*t! +--+ ain +ap 
is a polynomial in n of degree k + 1. 

1 


5 Sk(n) _ 


3.2 The Limit of a Function 
3.2.1 Definitions and Examples 


Let E be a subset of R and a a limit point of E. Let f : E — R be a real-valued 
function defined on E. 

We wish to write out what it means to say that the value f(x) of the function 
f approaches some number A as the point x € E approaches a. It is natural to call 
such a number A the limit of the values of the function f, or the limit of f as x 
tends to a. 


Definition 1 We shall say (following Cauchy) that the function f : E > R tends to 
A as x tends to a, or that A is the limit of f as x tends to a, if for every ¢ > 0 there 
exists 6 > O such that | f(x) — A| < ¢ for every x € E such that 0 < |x —a| <6. 


In logical symbolism these conditions are written as 


Ve >035 >0Vx € E (0< |x —a| <5 >|f(x)—A| <e). 
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If A is the limit of f(x) as x tends to a in the set E, we write f(x) > A 
as x > a, x € E, or limy+a.xer f(x) = A. Instead of the expression x — a, 
x € E, we shall as a rule use the shorter notation E 5 x — a, and instead of 
limy+a,xeE f(x) we shall write limgs,;sa f(x). 


Example I] Let E = R\0, and f(x) =x sin i. We shall verify that 


: il 
lim xsin—=0. 
E>x->0 x 


Indeed, for a given ¢ > O we choose 6 = e. Then for 0 < |x| < 56 = ¢, taking 
account of the inequality |x sin 1| < |x|, we shall have |x sin i| <€. 


Incidentally, one can see from this example that a function f : E > R may have 
a limit as E 5 x — a without even being defined at the point a itself. This is exactly 
the situation that most often arises when limits must be computed; and, if you were 
paying attention, you may have noticed that this circumstance is taken into account 
in our definition of limit, where we wrote the strict inequality 0 < |x — al. 

We recall that a neighborhood of a point a € R is any open interval containing 
the point. 


Definition 2 A deleted neighborhood of a point is a neighborhood of the point from 
which the point itself has been removed. 


If U(a) denotes a neighborhood of a, we shall denote the corresponding deleted 
neighborhood by U (a). 
The sets 


Ug(a) := EN U(a) 
Ug(a) = ENU(a) 


will be called respectively a neighborhood of a in E and a deleted neighborhood of 
ain E. 

If a is a limit point of F, then U g(a) # © for every neighborhood U (a). 

If we temporarily adopt the cumbersome symbols U £(@) and Vz (A) to denote 
the deleted 5-neighborhood of a in E and the e-neighborhood of A in R, then 
Cauchy’s so-called “s—d-definition” of the limit of a function can be rewritten as 


( lim f@)=A) := WVE(A) aU, (a) (f (Op(a)) C Vg(A)). 


E>x—>a 
This expression says that A is the limit of the function f : E > R as x tends 
to a in the set E if for every e-neighborhood V,(A) of A there exists a deleted 


neighborhood U a (a) of a in E whose image f - (a)) under the mapping f : 
E => Ris entirely contained in Va (A). 
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Taking into account that every neighborhood of a point on the real line contains 
a symmetric neighborhood (a 6-neighborhood) of the same point, we arrive at the 
following expression for the definition of a limit, which we shall take as our main 
definition: 


Definition 3 


( lim f(x)=A) := WVp(A) 3U (a) (f(Uz(@)) C Ve (A)). 


E>x->a 


Thus the number A is called the limit of the function f : E > R as x tends to a 
while remaining in the set E (a must be a limit point of E) if for every neighborhood 
of A there is a deleted neighborhood of a in E whose image under the mapping 
f : E => Ris contained in the given neighborhood of A. 

We have given several statements of the definition of the limit of a function. For 
numerical functions, when a and A belong to R, as we have seen, these statements 
are equivalent. In this connection, we note that one or another of these statements 
may be more convenient in different situations. For example, the original form is 
convenient in numerical computations, since it shows the allowable magnitude of 
the deviation of x from a needed to ensure that the deviation of f(x) from A will 
not exceed a specified value. But from the point of view of extending the concept of 
a limit to more general functions the last statement the definition is most convenient. 
It shows that we can define the concept of a limit of a mapping f : X — Y provided 
we have been told what is meant by a neighborhood of a point in X and Y, that is, 
as we Say, a topology is given on X and Y. 

Let us consider a few more examples that are illustrative of the main definition. 


Example 2 The function 


1 if x > 0, 
sgnx = 4 0 if x =0, 
—-1 ifx<0O 


(read “signum x”®) is defined on the whole real line. We shall show that it has no 
limit as x tends to 0. The nonexistence of this limit is expressed by 


VA €RAV(A) VU(0) ax € U(O) (f (x) ¢ V(A)), 


that is, no matter what A we take (claiming to be the limit of sgnx as x — 0), there 
is aneighborhood V (A) of A such that no matter how small a deleted neighborhood 
U (0) of 0 we take, that deleted neighborhood contains at least one point x at which 
the value of the function does not lie in V(A). 


8The Latin word for sign. 
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Since sgnx assumes only the values —1, 0, and 1, it is clear that no number 
distinct from them can be the limit of the function. For any such number has a 
neighborhood that does not contain any of these three numbers. 

But if A € {—1,0, 1} we choose as V(A) the e-neighborhood of A with ¢ = 5: 
The points —1 and 1 certainly cannot both lie in this neighborhood. But, no matter 
what deleted neighborhood U (0) of 0 we may take, that neighborhood contains both 
positive and negative numbers, that is, points x where f(x) = 1 and points where 
f@)=-l. , 

Hence there is a point x € U(O) such that f(x) ¢ V(A). 

If the function f : E — R is defined on a whole deleted neighborhood of a point 
a ER, that is, when U g(a) = Up(a) -_ U(a), we shall agree to write more briefly 
x —> a instead of ESx— a. 


Example 3 Let us show that lim,_,9 | sgnx| = 1. 

Indeed, for x € R\O we have |sgnx| = 1, that is, the function is constant and 
equal to 1 in any deleted neighborhood U(0) of 0. Hence for any neighborhood 
V(1) we obtain f (U(0)) = 1 € V(1). 

Note carefully that although the function |sgnx| is defined at the point 0 itself 
and |sgn0| = 0, this value has no influence on the value of the limit in question. 
Thus one must not confuse the value f(a) of the function at the point a with the 
limit lim,—., f(x) that the function has as x > a. 

Let R_ and R be the sets of negative and positive numbers respectively. 


Example 4 We saw in Example 2 that the limit limps,9 sgnx does not exist. Re- 
marking, however, that the restriction sgn|p— of sgn to R_ is a constant function 
equal to —1 and sgn|p, is a constant function equal to 1, we can show, as in Exam- 
ple 3, that 
lim sgnx=-—1, and lim sgnx=1, 
R_5x—-0 R ax 0 

that is, the restrictions of the same function to different sets may have different limits 
at the same point, or even fail to have a limit, as shown in Example 2. 


Example 5 Developing the idea of Example 2, one can show similarly that sin 1 has 
no limit as x > 0. 

Indeed, in any deleted neighborhood U(0) of 0 there are always points of the 
form — Turia and — 72 a , Where n € N. At these points the function assumes 
the values —1 and | respectively. But these two numbers cannot both lie in the e- 
neighborhood V (A) of a point A € R if e < 1. Hence no number A € R can be the 
limit of this function as x — 0. 


Example 6 If 


p= |xeRix= nen 


—1/2+2nmn 
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and 
E,=}xeR|x=———_, neN}, 
- {x ao a 


then, as shown in Example 4, we find that 


; oe! ’ i ol 
lim sin—=—1 and lim sin-=1., 
E_3x->0 x E,>x—>0 x 
There is a close connection between the concept of the limit of a sequence stud- 
ied in the preceding section and the limit of an arbitrary numerical-valued function 
introduced in the present section, expressed by the following proposition. 


Proposition 1 ° The relation limps xa f(x) = A holds if and only if for every 
sequence {xy} of points x, € E\a converging to a, the sequence { f (xn)} converges 
to A. 


Proof The fact that (imgsxy+a f(x) = A) > (limy-+00 f (%) = A) follows imme- 
diately from the definitions. Indeed, if limg3;-+q f(x) = A, then for any neighbor- 
hood V(A) of A there exists a deleted neighborhood U E(a) of the point a in E 
such that for x € U g(a) we have f(x) € V(A). If the sequence {x,} of points in 
E\a converges to a, there exists an index N such that x, € U E(a) forn > N, and 
then f(x) € V(A). By definition of the limit of a sequence, we then conclude that 
limpsoo f (Xn) = A. 

We now prove the converse. If A is not the limit of f(x) as E 5 x — a, then 
there exists a neighborhood V (A) such that for any n €N, there is a point x, in the 
deleted ; neighborhood of a in E such that f(x,) € V(A). But this means that the 
sequence { f (x,,)} does not converge to A, even though {x,} converges to a. 


3.2.2 Properties of the Limit of a Function 


We now establish a number of properties of the limit of a function that are constantly 
being used. Many of them are analogous to the properties of the limit of a sequence 
that we have already established, and for that reason are essentially already known 
to us. Moreover, by Proposition | just proved, many properties of the limit of a 
function follow obviously and immediately from the corresponding properties of 
the limit of a sequence: the uniqueness of the limit, the arithmetic properties of the 
limit, and passage to the limit in inequalities. Nevertheless, we shall carry out all the 
proofs again. As will be seen, there is some value in doing so. 


°This proposition is sometimes called the statement of the equivalence of the Cauchy definition of 
a limit (in terms of neighborhoods) and the Heine definition (in terms of sequences). 
E. Heine (1821-1881) — German mathematician. 


3.2 The Limit of a Function 111 


We call the reader’s attention to the fact that, in order to establish the properties 
of the limit of a function, we need only two properties of deleted neighborhoods of 
a limit point of a set: 


Bi) Ug(a) # ©, that is, the deleted neighborhood of the point in E is nonempty; 
By) WU -(a) VU p(a) aU g(a) (U g(a) CU (a) NU, (a)), that is, the intersection 
of any pair of deleted neighborhoods contains a deleted neighborhood. This obser- 
vation leads us to a general concept of a limit of a function and the possibility of 
using the theory of limits in the future not only for functions defined on sets of 
numbers. To keep the discussion from becoming a mere repetition of what was said 
in Sect. 3.1, we shall employ some useful new devices and concepts that were not 
proved in that section. 


a. General Properties of the Limit of a Function 
We begin with some definitions. 


Definition 4 As before, a function f : E — R assuming only one value is called 
constant. A function f : E > R is called ultimately constant as E > x —> a if it is 
constant in some deleted neighborhood U ¢ (a), where a is a limit point of E. 


Definition 5 A function f : E — R is bounded, bounded above, or bounded below 
respectively if there is a number C € R such that | f(x)| < C, f(x) < C, or C < 
F(x) for all x € E. 

If one of these three relations holds only in some deleted neighborhood U g(a), 
the function is said to be ultimately bounded, ultimately bounded above, or ulti- 
mately bounded below as E > x — a respectively. 


Example 7 The function f(x) = sin + Lx COs + | defined by this formula for x 40 
is not bounded on its domain of definition, but it is ultimately bounded as x —> 0. 


Example 8 The same is true for the function f(x) =x on R. 


Theorem 1 a) (f : E — R is ultimately the constant A as E >x —> a)> 
(limgsxsa f(x) = A). 

b) Glimgs;¢ f(x) > (f : E> R is ultimately bounded as E 3 x > a). 

c) dimgsx+a f(x) = A1) A Cimgsya f(x) = Az) = (Al = Az). 


Proof The assertion a) that an ultimately constant function has a limit, and asser- 
tion b) that a function having a limit is ultimately bounded, follow immediately 
from the corresponding definitions. We now turn to the proof of the uniqueness of 
the limit. 
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Suppose A; ~ Az. Choose neighborhoods V(A;) and V(A2) having no points 
in common, that is, V(A1) MN V(A2) = ©. By definition of a limit, we have 


lim £0) = 41 > 30 e(@) (f (Ue @) CVAD). 


lim f(x) = Ar => UE @ (f(Ue@) C V(A2)). 
E>x—a 
We now take a deleted neighborhood U E(a) of a (which is a limit point of EF) 
such that U g(a) C Ue (a)N Ue (a). (For example, we could take Ug(a) = Oe (ayn 
U . (a), since this intersection is also a deleted neighborhood.) 
Since Ug (a) # @, we take x € Ug(a). We then have f(x) € V(A1) N V(A2), 


which is impossible since the neighborhoods V(A,) and V(A2) have no points in 
common. 


b. Passage to the Limit and Arithmetic Operations 


Definition 6 If two numerical-valued functions f : E > R and g: E > R havea 
common domain of definition F, their sum, product, and quotient are respectively 
the functions defined on the same set by the following formulas: 


(f + g)(x) = FQ) + 8Q@), 
(f + 8)@) = f(x): 8Q@), 


(L)e = LO) if g(x) 40 forx ce E. 
g(x 


Theorem 2 Let f: E > Rand g: E > R be two functions with a common domain 
of definition. 
Tf limgsx—+a f (x) = A and limgsx-+a g(x) = B, then 


a) limgsyoa(f +g)(x)=A+tTB; 
b) limgsxsalf - g)(x) =A-B; 
c) limgsx+a(Z) = 4 if BAO and g(x) £0 forx ec E. 


As already noted at the beginning of Sect. 3.2.2, this theorem is an immediate 
consequence of the corresponding theorem on limits of sequences, given Proposi- 
tion |. The theorem can also be obtained by repeating the proof of the theorem on 
the algebraic properties of the limit of a sequence. The changes needed in the proof 
in order to do this reduce to referring to some deleted neighborhood U E(a) of ain 
E, where previously we had referred to statements holding “from some N € N on”. 
We advise the reader to verify this. 

Here we shall obtain the theorem from its simplest special case when A = B = 0. 
Of course assertion c) will then be excluded from consideration. 

A function f : E — R is said to be infinitesimal as E 3x > a if 
limgsxsa f(x) =0. 
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Proposition 2 a) Ifa: E > Rand B: E > R are infinitesimal functions as E > 
x => a, then their sum a+ B: E > Ris also infinitesimal as E> x > a. 

b) Ifa: E > Rand B: E > Rare infinitesimal functions as E > x — a, then 
their product a - B : E > R is also infinitesimal as E > x > a. 

c) Ifa: E > R is infinitesimal as E 3 x > a and 8B: E > R is ultimately 
bounded as E > x — a, then the product a. B : E > R is infinitesimal as E > 
xa. 


Proof a) We shall verify that 


( lim a(x) =0) K (lim B(x) =0) = (lim (@ + B)(x) =0). 


E>x—>a axa 


Let ¢ > 0 be given. By definition of the limit, we have 


(lim a(x) =0) > (30-00 Vx €Up(a) (Jes < 5)). 


E2>x—->a 
(lim Be) =0) => (30200 Vx €U g(a) (6 < 5): 


Then for the deleted neighborhood U E(a) CU 3 (a)NU i“ (a) we obtain 
Vx €U g(a) |(@ + B)(x)| = lax) + B(2)| < |e(x)| + [B@)| <e. 


That is, we have verified that limgs,.+q(@ + B)(x) = 0. 

b) This assertion is a special case of assertion c), since every function that has a 
limit is ultimately bounded. 

c) We shall verify that 


( lim a(x) =0) A (AM €R3U g(a) Vx € Ue (a) (|B(x)| < M)) > 


E>x—>a 
= (_im,0980=0). 


Let ¢ > 0 be given. By definition of limit we have 


(lim a(x) =0) = (20,00 vx € U'p(a) (\a < =)). 


E>x—>a 


Then for the deleted neighborhood U - (a) Cc U ‘ (ayn U E(a), we obtain 


vx € Up (a) |(a- BY(x)| = |a@)B@)| = |a@)||B@)| <M =e. 


Thus we have verified that limgzs,+q a(x)B(x) = 0. 


The following remark is very useful: 
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Remark I 


CO (ala fea Lalani 2 aaa 


In other words, the function f : E — R tends to A if and only if it can be repre- 
sented as a sum A + a(x), where a(x) is infinitesimal as E > x — a. (The function 
a(x) is the deviation of f(x) from A.)!° 

This remark follows immediately from the definition of limit, by virtue of which 

li =A li — A) =0. 
Be eS oe Oe) 
We now give the proof of the theorem on the arithmetic properties of the limit 


of a function, based on this remark and the properties of infinitesimal functions that 
we have established. 


Proof a) If limgsx+q f(x) = A and limgs;+q g(x) = B, then f(x) =A+a(x) 
and g(x) = B + B(x), where a(x) and 6(x) are infinitesimal as E 5 x — a. Then 
(f +8) @) = f(x) +ex) = Ata(x) + B+B(x) = (A+B) +y (x), where y (x) = 
a(x) + B(x), being the sum of two infinitesimals, is infinitesimal as E > x > a. 
Thus limgsxysa(f +g)*%)=A+ B. 
b) Again representing f(x) and g(x) in the form f(x) = A+ a(x), g(x) = 
B+ B(x), we have 


(f -g)(x) = f (x)g(x) = (A+ a(x))(B+ B)) =A-B+y(a), 


where y(x) = AB(x) + Ba(x) + a(x)B(x) is infinitesimal as E 5 x — a because 
of the properties just proved for such functions. 

Thus, limgsysa(f -g)(x) =A-B. 

c) We once again write f(x) = A+ a(x) and g(x) = B+ B(x), where 
limgsx +g a(x) = 0 and limgsy-4, B(x) = 0. 

Since B # 0, there exists a deleted neighborhood U g(a), at all points of which 
B(x) < BE, and hence |g(x)| =|B + B(x)| > |B] — |B(x)| > BE. Then in Ue (a) 
we shall also have Tal < ae that is, the function aw is ultimately bounded as 
E>x— a. We then write 


(LZ) A f(x) A Ata(x) A _ 
g B g(x) B B+B(x) B- 


1 1 
= 2 : p (Bat) oF AB(x)) = y (x). 


'0Here is a curious detail. This very obvious representation, which is nevertheless very useful 
on the computational level, was specially noted by the French mathematician and specialist in 
mechanics Lazare Carnot (1753-1823), a revolutionary general and academician, the father of 
Sadi Carnot (1796-1832), who in turn was the creator of thermodynamics. 
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By the properties of infinitesimals (taking account of the ultimate boundedness of 


a) we find that the function y (x) is infinitesimal as E > x — a. Thus we have 


proved that limgsy+a(£)(x) = 4. 


c. Passage to the Limit and Inequalities 


Theorem 3 a) [f the functions f : E > R and g: E > R are such that 
limgsxsa f(x) = A, and limgsx+q g(x) = B and A < B, then there exists a 
deleted neighborhood U g(a) of a in E at each point of which f (x) < g(x). 

b) If the relations f(x) < g(x) < h(x) hold for the functions f : E> R, g: 
E => R, and h: E > R, and if limgsy—+a f (x) = limgsysgh(x) = C, then the 
limit of g(x) exists as E > x —> a, and limgs x+y g(x) =C. 


Proof a) Choose a number C such that A < C < B. By definition of limit, we find 
deleted neighborhoods U’,(a) and U';(a) of a in E such that | f(x) — Al <C — A 
for x € U',(a) and |g(x) — B| < B—C forxe U',(a). Then at any point of a 
deleted neighborhood U E(@) contained in U - (ayn U - (a), we find 


fe) <A+(C-A)=C=B-(B-C) <g(x). 


b) If limgsy;sa f(x) = limgsx—+aq h(x) = C, then for any fixed ¢ > 0 there exist 
Geleied neighborhoods U : (a) and U ' (a) of a in E such that C — ¢ < f(x) for 
x €U,(a) andh(x)<C+eforxe U,(a). Then at any point of a deleted neigh- 
borhood U g(a) contained in U',(a) ‘a U'-(a), we have C —«e < f(x) < g(x) < 
h(x) <C +6, that is, |g(x) — C| < e, and consequently limgsx+g g(x) =C. 


Corollary Suppose limgs,-+a f(x) = A and limgsx—+q g(x) = B. Let U g(a) bea 
deleted neighborhood of a in E. 


a) If f(x) > g(x) forall x € U g(a), then A> B; 
b) f(x) = g(x) forallx e€ U g(a), then A> B; 
c) f(x) > B forall x €Ug(a), then A> B; 

d) f(x) > B forall x €U g(a), then A> B. 


Proof Using proof by contradiction, we immediately obtain assertions a) and b) of 
the corollary from assertion a) of Theorem 3. Assertions c) and d) follow from a) 
and b) by taking g(x) = B. 


d. Two Important Examples 


Before developing the theory of the limit of a function further, we shall illustrate the 
use of the theorems just proved by two important examples. 
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Fig. 3.1 B = (cosz, sin 2) 


Example 9 


sin x 
lim —— = 1. 
x>0 X 


Here we shall appeal to the definition of sinx given in high school, that is, sin x is 
the ordinate of the point to which the point (1, 0) moves under a rotation of x radians 
about the origin. The completeness of such a definition is entirely a matter of the 
care with which the connection between rotations and real numbers is established. 
Since the system of real numbers itself was not described in sufficient detail in high 
school, one may consider that we need to sharpen the definition of sinx (and the 
same is true of cos x). 

We shall do so at the appropriate time and justify the reasoning that for now will 
rely on intuition. 

a) We shall show that 


2 sin x a 
cos’ x < —— <1 FORO sles 
x 


Proof Since cos* x and sin are even functions, it suffices to consider the case 0 < 


x < 7/2. By Fig. 3.1 and the definition of cos x and sinx, comparing the area of the 
sector <OCD, the triangle AO AB, and the sector {OAB, we have 


1 ~ 1 1 
SazocD = Oe -|CD|= 5 rosa ecos) = 5 ¥ COS" x < 


1 
< SAOAB = ZlOAl -|BC|= 


1 
-l-sinx = 5 sinx < 


1 
< Soap = 5|0A|-|AB| = 


Dividing these inequalities by 5x, we find that the result is what was asserted. 


b) It follows from a) that 
| sinx| < |x| 


for any x € R, equality holding only at x = 0. 
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Proof For 0 < |x| < 2/2, as shown in a), we have 
| sinx| < |x|. 


But | sinx| < 1, so that this last inequality also holds for |x| > 2/2 > 1. Only for 
x = 0 do we find sinx =x =0. 


c) It follows from b) that 


lim sinx = 0. 
x>0 


Proof Since 0 < |sinx| < |x| and lim,-—,9 |x| = 0, we find by the theorem on the 
limit of a function and inequalities (Theorem 3) that lim,—.9|sinx| = 0, so that 
lim,_.9 sinx = 0. 


sinx __ 1 


d) We shall now prove that lim,_.9 == = 


Proof Assuming that |x| < 2/2, from the inequality in a) we have 


5) sin x 
x <— <1. 
x 


1 — sin 
But lim,;.9(1 — sin? x) = 1 — lim, osinx - lim,_,9sinx = 1 — 0 = 1, so that 
by the theorem on passage to the limit and inequalities, we conclude that 


limy-+9 S8* = 1. 


Example 10 (Definition of the exponential, logarithmic, and power functions using 
limits) We shall now illustrate how the high-school definition of the exponential 
and logarithmic functions can be completed by means of the theory of real numbers 
and limits. 


For convenience in reference and to give a complete picture, we shall start from 
the beginning. 


a) The exponential function. Let a > 1. 


1° For n € N we define inductively a! :=a, a"t!:=a" -a. 
In this way we obtain a function a” defined on N, which, as can be seen from the 
definition, has the property 


q” 


qe 
ifm,neNandm>n. 
2° This property leads to the natural definitions 


1 
a :=1, a= — forneN, 
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which, when carried out, extend the function a” to the set Z of all integers, and then 


a” 7 a” ae qntn 


for any m,n €Z. 

3° In the theory of real numbers we have observed that for a > 0 andn € N there 
exists a unique nth root of a, that is, a number x > 0 such that x” = a. For that 
number we use the notation a!/”. It is convenient, since it allows us to retain the law 
of addition for exponents: 


a= a = (al/”)” BS qi/n Ai eg iln = qiint +n 
For the same reason it is natural to set a”"/" := (a!/")" and a7!/" := (a'/")7! 
forn € Nand m € Z. If it turns out that a@@")/@ = q™/" for k € Z, we can consider 


that we have defined a’ forr € Q. 
4° For numbers 0 < x, 0 < y, we verify by induction that for n €¢ N 


(x < y) @ (x” <y"), 
so that, in particular, 
Gane Sy). 


5° This makes it possible to prove the rules for operating with rational exponents, 
in particular, that 


amb) /k) — gmln torkeZ 


and 


qimi/m , qm2/n2 — gmi/nitma/ng | 


Proof Indeed, al™)/@) 5 0 and a”/" > 0. Further, since 
(a/b) = ((ali@oymhyn x 


1/(nk)\mk-nk 1/(nk)\nkymk mk 
(am ye = (aren =a 


and 
(a7ny = (any) = ak, 


it follows that the first of the inequalities that needed to be verified in connection 
with point 4° is now established. 
Similarly, since 


(arm : gerne? <4 (ane ; (qr? = 


a3 Cg i : (gieyyen — qitin2 , qm2ni 


myn man 
= qgitingtmgn, 
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and 
(area me yore = gaa) i = 


= (Gas! One) EG nie ae = qgminatmany | 


the second equality is also proved. 


Thus we have defined a’ for r € Q and a’ > 0; and for any r1, r2 € Q, 


=q't, 


a a” 


6° It follows from 4° that for r},r2 € Q 
(r}<n)> (a! < ay; 


Proof Since (1 <a) (1 <a!/") for n € N, which follows immediately from 4°, 
we have (a!/")" = a’"/" > 1 forn,m €N, as again follows from 4°. Thus for 1 <a 
and r > 0, r € Q, we have a’ > 1. 

Then for 71 < rz we obtain by 5° 


a? =al-a?"sa"-Ll=a". 


7° We shall show that for ro € Q 


lim a’ =a’. 
Qsr—>ro 


Proof We shall verify that a? > 1 as Q5 p — 0. This follows from the fact that 
for |p| < , we have by 6° 


aa!" <@P <q'/", 


We know that a!/” — 1 (and a~!/" -> 1) as n > oo. Then by standard reasoning 
we verify that for e > 0 there exists 5 > 0 such that for | p| < 6 we have 


l—e<a’<l+e. 


We can take ‘ as 5 here if 1—e <a7!/"anda!/" <1+.e. 
We now prove the main assertion. 
Given € > 0, we choose 6 so that 


l-—ea" <a? <1+ea” 
for |p| < 6. If now |r — ro| < 6, we have 
a(1 —ea~") <a’ =a"-a" sql +ea~"), 


which says 


a —e<a’ <a-+<e. 
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Thus we have defined a function a” on Q having the following properties: 


a =a>l; 


a'.q? = qt. 
<a™  forr, <1ro; 
a's>a™ asQarn—-n. 


We now extend this function to the entire real line as follows. 

8° Letx €R,s= SUPgs; <x 4’, and i = infgs,>, a". Itis clear that s, i € R, since 
for r) <x <r2 wehavea" <a”. 

We shall show that actually s =i (and then we shall denote this common value 
by a*). 


Proof By definition of s andi we have 
a'<s<i<a” 


forr) <x <r2. ThenO <i—s <a™’—-—a" =a" (a2 — 1) <s(a’2™"! — 1). But 
a? — 1asQ=5 p— 0,s0 that for any ¢ > 0 there exists 6 > 0 such that a’2~"! — 1 < 
é/s for0 <rzg—r, <6. We then find that 0 <i —s < ¢, and since ¢ > 0 is arbitrary, 
we conclude that i = s. 


We now define a* :=s =i. 
9° Let us show that a* = lim@sr—x a’. 


. . / 
Proof Taking 8° into account, for ¢ > 0 we find r’ < x such that s —e <a" <s= 
‘ ” F f be “ £ ” 
a* andr” such thata* =i<a"™ <i+e.Sincer’ <r <r” impliesa” <a’ <a’ , 
we then have, for all r € Q in the open interval Jr’, r’’[, 


a—e<a <a*+e. 


We now study the properties of the function a* so defined on R. 
10° For x1,.x2 € Randa > 1, (x1 < x2) > (a! <a”). 


Proof On the open interval |x, x2[ there exist two rational numbers r; < rz. If 
xX) <r, <1r2 < x2, by the definition of a* given in 8° and the properties of the 
function a* on Q, we have 


q! <q" <qr <a™®, 
11° For any x1, x2 € R, a®! -a® =a™! 4%, 


Proof By the estimates that we know for the absolute error in the product and by 
property 9°, we can assert that for any ¢ > 0 there exists 6’ > 0 such that 
Sf xg vg? eegtsah he 


2 2 
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for |x, —1r,| < 6’, and |x2 —r2| < 5’. Making 6’ smaller if necessary, we can choose 
5 < 6’ such that we also have 


E é 
qtr. at < quite <qitn oe 


Zz 


for |xj —7r,| <6 and |x2 — r2| < 4, that is, |(x; + x2) — (71 +12)| < 26. 
But a”! - a’ =a"'*", for r1, ro € Q, so that these inequalities imply 


x] 


a .d@—e<dat™ <a". aq? +e. 


Since ¢ > 0 is arbitrary, we conclude that 


x] 


x2 quit 


a’:a 


12° lim,_, xp @ =a. (We recall that “x — xo” is an abbreviation of “IR 5 x > 
xo”.) 


Proof We first verify that lim,—.9 a* = 1. Given ¢ > 0, we find n € N such that 
lage ge gl Sra 
Then by 10°, for |x| < 1/n we have 
topeag ag 2a a1. 


that is, we have verified that limy_.9a* = 1. 
If we now take 6 > 0 so that |a*~*° — 1| < ea~™ for |x — xo| < 6, we find 


a —¢e <a* =a (a - 1) <a % +e, 


which verifies that lim, ,,a* =a”. 


13° We shall show that the range of values of the function x +> a* is the set Ry 
of positive real numbers. 


Proof Let yo € R,. If a > 1, then as we know, there exists n € N such that a~” < 
yo <a". 
By virtue of this fact, the two sets 


A={xeR|a* < yo} and B={xeR|yo<a’‘} 


are both nonempty. But since (x, < x2) = (a*! < a*?) (when a > 1), for any num- 
bers x1,x2 € R such that x; € A and x2 € B we have x; < x2. Consequently, the 
axiom of completeness is applicable to the sets A and B, and it follows that there 


exists x9 such that x; < x9 < x2 for all x; € A and x2 € B. We shall show that 
a“ = yo. 


122 3 Limits 


If a*° were less than yo, then, since qrotl/n _, gX0 as n > 00, there would be a 
number n € N such that a*°+!/” < yo. Then we would have (xo + 1) € A, while the 
point xo separates A and B. Hence the assumption a*° < yo is untenable. Similarly 
we can verify that the inequality a*° > yo is also impossible. By the properties of 
real numbers, we conclude from this that a*° = yo. 


14° We have assumed up to now that a > 1. But all the constructions could be 
repeated for 0 < a < 1. Under this condition 0 < a” < 1 if r > 0, so that in 6° and 
10° we now find that (x1 < x2) => (a*! > a*2) where 0 <a <1. 

Thus for a > 0, a 4 1, we have constructed a real-valued function x + a* on the 
set R of real numbers with the following properties: 


1) al=a; 

2) a -a@= qvit*, 

3) a® > a“ asx > x0; 

4) (a! <a?) S (x, < x2) ifa> 1, and (a! > a?) S (xy <x) if0<a<1; 

5) the range of values of the mapping x > a* is R; = {y €e R| 0 < y}, the set 
of positive numbers. 


Definition 7 The mapping x +> a* is called the exponential function with base a. 
The mapping x + e*, which is the case a = e, is encountered particularly often 
and is frequently denoted exp x. In this connection, to denote the mapping x b> a’, 
we sometimes also use the notation exp, x. 
b) The logarithmic function. The properties of the exponential function show that 
it is a bijective mapping exp, : R — R_. Hence it has an inverse. 


Definition 8 The mapping inverse to exp, : R > R+ is called the logarithm to base 
a (0 <a,a #1), and is denoted 


log, :R4+— R. 


Definition 9 For base a = e, the logarithm is called the natural logarithm and is 
denoted In: Ry > R. 

The reason for the terminology becomes clear under a different approach to log- 
arithms, one that is in many ways more natural and transparent, which we shall 
explain after constructing the fundamentals of differential and integral calculus. 

By definition of the logarithm as the function inverse to the exponential function, 
we have 


Vx €R (log, (a*) =x), 
VyeR, (a'°& =) 


It follows from this definition and the properties of the exponential function in 
particular that in its domain of definition R; the logarithm has the following prop- 
erties: 
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1’) log,a=1; 

2’) logg (y1 - y2) = log, yi + log, ya; 

3’) log, y > log, yo as R43 y—> yoE Rg; 

4’) (log, y1 < log, y2) > (1 < y2) ifa > 1 and (log, yi > log, y2) > (1 < y2) 
if0<a< 1; 

5’) the range of values of the function log, : R+ — R is the set R of all real num- 
bers. 


Proof We obtain 1’) from property 1) of the exponential function and the definition 
of the logarithm. 

We obtain property 2’) from property 2) of the exponential function. Indeed, let 
x1 = log, y1 and x2 = log, y2. Then yj = a”! and y2 = a”, and so by 2), yi - y2 = 
a*! . q*2 = a*™!**2, from which it follows that log, (71 - y2) = x1 + x2. 

Similarly, property 4) of the exponential function implies property 4’) of the log- 
arithm. 


It is obvious that 5) > 5’). 
Property 3’) remains to be proved. 
By property 2’) of the logarithm we have 


y 
log, a log, yo = log, (=) ’ 
YO 
and therefore the inequalities 
—e <log, y—log, w <& 


are equivalent to the relation 


log, (a~°) =—e < tz, (=) <e=log,(a’), 
YO 


which by property 4’) of the logarithm is equivalent to 
-a< oe <a fora>1, 
YO 


E 


G2 2a for0O<a<l. 


yO 


In any case we find that if 
yoa © <y<yoa® whena> 1 


or 


© when 0 <a <1, 


yoa® <y < yoa” 
we have 


—e <log, y — log, yo <€. 
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Fig. 3.2 


Thus we have proved that 


lim lo =lo : 
Rysy—> eR, Sa Y Sa YO 


Figure 3.2 shows the graphs of the functions e*, 10*, Inx, and log; x =: log x; 
Fig. 3.3 gives the graphs of (2), 0.1", logy ;_.x, and logo | x. 

We now give a more detailed discussion of one property of the logarithm that we 
shall have frequent occasion to use. 

We shall show that the equality 
6’) 


log, (b) = a log, b 


holds for any b > 0 and anya € R. 


Proof 1° The equality is true for a =n € N. For by property 2’) of the logarithm 
and induction we find log, (v1 --- yn) = log, yi +--+ log, yn, so that 


log, (b") = log, b+ --- + log, b =n log, b. 
2° log, (b~!) = — log, b, for if B = log, b, then 
b=a’, bb '=a® and log,(b7') =—8£. 
3° From 1° and 2° we now conclude that the equality log, (b%) = a log, b holds 
fora € Z. 


4° log, (b!/") = 4 log, b for n € Z. Indeed, 


log, b =log, (bl/")” =nlog, (b'/"). 
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Fig. 3.3 : y 
(1/e)* 


logo 1 x 


loge X 


5° We can now verify that the assertion holds for any rational number a = veQ. 
In fact, 


m 
Jog, b = mlog,(b"/") = log, (b4")" = log, (0!) 


6° But if the equality log, b” = rlog, b holds for all r € Q, then letting r in 
Q tend to a, we find by property 3) for the exponential function and 3’) for the 
logarithm that if r is sufficiently close to a, then b” is close to b® and log, b” is 
close to log, b%. This means that 
lim log, b’ =log, b®. 


Qer-a 


But log, b” =r log, b, and therefore 


log, b% = genes log, b' = ee rlog, b=alog, b. 


From the property of the logarithm just proved, one can conclude that the follow- 
ing equality holds for any a, 8 € R anda > 0: 
6) (a®%)P =a%. 


Proof For a = 1 we have 1% = | by definition for all a € R. Thus the equality is 
trivial in this case. 
If a ~ 1, then by what has just been proved we have 


log, ((a*)") = Blog, (a*) = B - a log, a= B -a =log,(a"*), 


which by property 4’) of the logarithm is equivalent to this equality. 


c) The power function. If we take 1% = 1, then for all x > 0 and a € R we have 
defined the quantity x (read “‘x to power a”). 
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Fig. 3.4 


Definition 10 The function x +> x® defined on the set R+ of positive numbers is 
called a power function, and the number a is called its exponent. 


A power function is obviously the composition of an exponential function and 
the logarithm; more precisely 


a " 
xt = 8a Dee. qt Sar. 


Figure 3.4 shows the graphs of the function y = x® for different values of the 
exponent. 


3.2.3. The General Definition of the Limit of a Function (Limit 
over a Base) 


When proving the properties of the limit of a function, we verified that the only 
requirements imposed on the deleted neighborhoods in which our functions were 
defined and which arose in the course of the proofs were the properties B;) and 
Bz), mentioned in the introduction to the previous subsection. This fact justifies the 
definition of the following mathematical object. 


a. Bases; Definition and Elementary Properties 
Definition 11 A set 6 of subsets B C X of a set X is called a base in X if the 
following conditions hold: 


B,) VBE B(BF QB); 
Bo) VB} €BVB2.E€ BABE B(BCB,NB)). 


In other words, the elements of the collection 6 are nonempty subsets of X and 
the intersection of any two of them always contains an element of the same collec- 
tion. 

In Table 3.1 we list some of the more useful bases in analysis. 
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Table 3.1 

Notation for Read Sets (elements) of the Definition of and notation 

the base base for elements 

xa x tends to a Deleted neighborhoods U (a) := {x E€R|a-—6, < 
ofaeR <x<atéd.Ax a}, 

where 6; > 0, d2 > 0 

x — oo x tends to infinity Neighborhoods of U(oo) := {x ER] 6 < |x]}, 
infinity where 5€ R 

x—>a,xEE x tends toa in E Deleted neighborhoods" U E(@):=EN U (a) 

or ofainE 

E2>x->a 

or 

X—>cra 

x>00,x EE x tends to Neighborhoods” of Ug() := ENU(cx) 

or infinity in E infinity in E 

E2x>o 


or 
X —>ef 00 


“Tt is assumed that a is a limit point of E 


““Tt is assumed that E is not bounded 


If E= Et = {x €R|x >a} Gesp. E = EZ = {x € R| x < a}) we write 
x > a+0 (resp. x > a — 0) instead of x > a, x € E, and we say that x tends 
to a from the right (resp. x tends to a from the left) or through larger values 
(resp. through smaller values). When a = 0 it is customary to write x — +0 (resp. 
x — —0) instead of x > 0+ 0 (resp. x — 0-0). 

The notation E 5 x > a+0 (resp. E 5 x — a — 0) will be used instead of 
x—> a,x € ENE} (resp. x > a, x € EM EZ). It means that x tends to a in E 
while remaining larger (resp. smaller) than a. 

If 


E=E,={xeR|e<x} (tsp. =F, =x eR |x <x), 


we write x —> +00 (resp. x > —0o) instead of x — oo, x € E and say that x tends 
to positive infinity (resp. x tends to negative infinity). 

The notation E 3 x > +c (resp. E > x — —oo) will be used instead of 
x—> 00,x € EN EX (resp. x > 00, x € ENE). 

When E =N, we shall write (when no confusion can arise), as is customary in 
the theory of limits of sequences, n — oo instead of x > co, x EN. 

We remark that all the bases just listed have the property that the intersection of 
two elements of the base is itself an element of the base, not merely a set containing 
an element of the base. We shall meet with other bases in the study of functions 
defined on sets different from the real line.!! 


‘lor example, the set of open disks (not containing their boundary circles) containing a given 
point of the plane is a base. The intersection of two elements of the base is not always a disk, but 
always contains a disk from the collection. 
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We note also that the term “base” used here is an abbreviation for what is called 
a “filter base’, and the limit over a base that we introduce below is, as far as analysis 
is concerned, the most important part of the concept of a limit over a filter,!* created 
by the modern French mathematician H. Cartan. 


b. The Limit of a Function over a Base 
Definition 12 Let f : X — R be a function defined on a set X and BG a base in X. 
A number A € R is called the limit of the function f over the base B if for every 
neighborhood V (A) of A there is an element B € 6 whose image f (B) is contained 
in V(A). 

If A is the limit of f : X — R over the base LB, we write 


lim f(x) =A. 


We now repeat the definition of the limit over a base in logical symbols: 


(lim fo) = A) := WV (A) 4B € B (f(B) C V(A)). 


Since we are considering numerical-valued functions at the moment, it is useful 
to keep in mind the following form of this fundamental definition: 


(lim f(x) =A) := Ve > 04B € BYx € B (| f(x) — A] <e). 


In this form we take an ¢-neighborhood (symmetric with respect to A) instead 
of an arbitrary neighborhood V(A). The equivalence of these definitions for real- 
valued functions follows from the fact mentioned earlier that every neighborhood of 
a point contains a symmetric neighborhood of the same point (carry out the proof in 
full!). 

We have now given the general definition of the limit of a function over a base. 
Earlier we considered examples of the bases most often used in analysis. In a specific 
problem in which one or another of these bases arises, one must know how to decode 
the general definition and write it in the form specific to that base. 

Thus, 


( lim | f(@) =A) := Ve > 058 > 0 ¥x € Ja — 8, al (| f(x) — A] <e), 


x a— 


(lim fo) =A) := We > 048 € RVx <5 (| f(x) — A] <e). 


!2Ror more details, see Bourbaki’s, General topology, Addison-Wesley, 1966. 


3.2 The Limit of a Function 129 

In our study of examples of bases we have in particular introduced the concept 
of a neighborhood of infinity. If we use that concept, then it makes sense to adopt 
the following conventions in accordance with the general definition of limit: 


(lim f= 00) := WV (co) 3B € B (f(B) C V(0o)), 


or, what is the same, 


\: 


(lim f@) = +00) :=Ve € RABE BYx eB (e < f(x), 


(lim fx) = 00) :=Ve > 0 4B EBYx eB (e <|f(x) 


(lim fo) = —oo) :=We €RABEBYx € B (f(x) <e). 


The letter ¢ is usually assumed to represent a small number. Such is not the case 
in the definitions just given, of course. In accordance with the usual conventions, for 
example, we could write 


( lim | f(x) = 00) :=Ve ERASER Vx > 8 (f(x) <2). 


We advise the reader to write out independently the full definition of limit for 
different bases in the cases of both finite (numerical) and infinite limits. 

In order to regard the theorems on limits that we proved for the special base 
E> x — ain Sect. 3.2.2 as having been proved in the general case of a limit over an 
arbitrary base, we need to make suitable definitions of what it means for a function 
to be ultimately constant, ultimately bounded, and infinitesimal over a given base. 


Definition 13 A function f : X — R is ultimately constant over the base B if there 
exists a number A € R and an element B € B such that f(x) = A for all x € B. 


Definition 14 A function f : X — R is ultimately bounded over the base B if there 
exists a number c > 0 and an element B € B such that | f(x)| <c for all x € B. 


Definition 15 A function f : X — R is infinitesimal over the base B if limg f (x) 
=0. 


After these definitions and the fundamental remark that all proofs of the theorems 
on limits used only the properties B;) and Bz), we may regard all the properties of 
limits established in Sect. 3.2.2 as valid for limits over any base. 

In particular, we can now speak of the limit of a function as x — oo or as x > 
—oo or as X > +00. 

In addition, we have now assured ourselves that we can also apply the theory of 
limits in the case when the functions are defined on sets that are not necessarily sets 
of numbers; this will turn out to be especially valuable later on. For example, the 
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length of a curve is a numerical-valued function defined on a class of curves. If we 
know this function on broken lines, we can define it for more complicated curves, 
for example, for a circle, by passing to the limit. 

At present the main use we have for this observation and the concept of a base 
introduced in connection with it is that they free us from the verifications and formal 
proofs of theorems on limits for each specific type of limiting passage, or, in our 
current terminology, for each specific type of base. 

In order to master completely the concept of a limit over an arbitrary base, we 
shall carry out the proofs of the following properties of the limit of a function in 
general form. 


3.2.4 Existence of the Limit of a Function 


a. The Cauchy Criterion 
Before stating the Cauchy criterion, we give the following useful definition. 


Definition 16 The oscillation of a function f : X > Rona set E C X is 


o(f, E):= sup | f (x1) — f (x2) 


X1,X2E€E 


’ 


that is, the least upper bound of the absolute value of the difference of the values of 
the function at two arbitrary points x1, x2 € E. 


Example 11 w(x?,{—1,2]) =4; 
Example 12 w(x,[—1,2]) =3, 
Example 13 w(x, |]—1,2D =3; 
Example 14 w(sgnx,[—1, 2]) = 2; 
Example 15 w(sgnx, [0, 2]) = 1; 
Example 16 w(sgnx, ]0,2]) =0. 
Theorem 4 (The Cauchy criterion for the existence of a limit of a function) Let X 
be a set and B a base in X. 
A function f : X — R has a limit over the base B if and only if for every ¢ > 0 


there exits B € B such that the oscillation of f on B is less than e. 


Thus, 
alia fs) &Ve>0dBeB (o(f, B)< é). 
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Proof Necessity. If limg f(x) = A € R, then, for all ¢ > 0, there exists an element 
B € B such that | f(x) — A| < €/3 for all x € B. But then, for any x1,x2 € B we 
have 


2 
| fa) — f@2)| <| fer) — A] +| fa) - Al < = 


and therefore w(f; B) <e&. 


Sufficiency. We now prove the main part of the criterion, which asserts that if for 
every € > 0 there exists B € B for which w(f, B) < e, then the function has a limit 
over B. 

Taking € successively equal to 1,1/2,...,1/n,..., we construct a sequence 
B,, Bo,..., By... of elements of B such that w(f, By) < 1/n,n €N. Since B, ~ 
@, we can choose a point x, in each B,. The sequence f (x1), f(%2),..., f(Xn),--- 
is a Cauchy sequence. Indeed, B, 1 By, # @, and, taking an auxiliary point x € 
By By, we find that |f (xn) — fQm)| < If Gn) — FOI + 1f@) — fm) < 
1/n + 1/m. By the Cauchy criterion for convergence of a sequence, the sequence 
{f(xn),n € N} has a limit A. It follows from the inequality established above, 
if we let m — o, that | f(x,) — A| < 1/n. We now conclude, taking account 
of the inequality w(f; Bn) < 1/n, that | f(x) — A| < e at every point x € B, if 
n>N=[2/e] +1. 


Remark This proof, as we shall see below, remains valid for functions with values in 
any so-called complete space Y. If Y = R, which is the case we are most interested 
in just now, we can if we wish use the same idea as in the proof of the sufficiency of 
the Cauchy criterion for sequences. 


Proof Setting mg = inf,.g f(x) and Mg = sup,.-, f(x), and remarking that 
mB, < MBB, < Mp,nB, < Mp, for any elements B, and By» of the base 6, we 
find by the axiom of completeness that there exists a number A € R separating 
the numerical sets {mg} and {Mg}, where B € B. Since w(f; B) = Mg — mpg, we 
can now conclude that, since w(f; B) < €, we have | f(x) — A| < € at every point 


xe. 


Example 17 We shall show that when X = N and B is the base n > co, n EN, 
the general Cauchy criterion just proved for the existence of the limit of a function 
coincides with the Cauchy criterion already studied for the existence of a limit of a 
sequence. 

Indeed, an element of the base n > co,n EN, isaset B=NNU(cC)={neEN | 
N <n} consisting of the natural numbers n € N larger than some number N é€ R. 
Without loss of generality we may assume N €N. The relation w(f; B) < ¢ now 
means that | f(n1) — f(12)| <¢ foralln,,n2>N. 

Thus, for a function f : N > R, the condition that for any ¢ > 0 there exists 
B € B such that w(f; B) < € is equivalent to the condition that the sequence { f (”)} 
be a Cauchy sequence. 
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b. The Limit of a Composite Function 


Theorem 5 (The limit of a composite function) Let Y be a set, By a base in Y, and 
g:Y — Ra mapping having a limit over the base By. Let X be a set, Bx a base 
in X and f : X — Y a mapping of X into Y such that for every element By € By 
there exists Bx € Bx whose image f (Bx) is contained in By. 

Under these hypotheses, the composition g o f : X — R of the mappings f and 
g is defined and has a limit over the base By and limp, (g 0 f)(x) = limgy g(y). 


Proof The composite function go f : X — R is defined, since f(X) C Y. Suppose 
limp, g(y) = A. We shall show that limg, (g o f)(x) = A. Given a neighborhood 
V(A) of A, we find By € By such that g(By) C V(A). By hypothesis, there exists 
By € By such that f(Bxy) C By. But then (g o f)(Bx) = g(f(Bx)) C g(By) C 
V (A). We have thus verified that A is the limit of the function (go f) : X > R over 
the base By. 


Example 18 Let us find the following limit: 


If we set g(y) = ome and f(x) = 7x, then (go f)(x) = “42. In this case Y = 


siny __ 


R\0O and X = R. Since limy_,0 g(y) = limy..0 >> = 1, we can apply the theorem 
if we verify that for any element of the base y — 0 there is an element of the base 
x —> 0 whose image under the mapping f(x) = 7x is contained in the given element 
of the base y > 0. 

The elements of the base y — 0 are the deleted neighborhoods U y (0) of the 
pointO eR. 

The elements of the base x — 0 are also deleted neighborhoods U x (0) of the 
point 0 € R. Let Uy(0)={yEeR|a<y<f,y40} (where a, 6 € R and a < 0, 
B > 0) be an arbitrary deleted neighborhood of 0 in Y. If we take U x(0)= {xe 
R| 7 <x< x O}, this deleted neighborhood of 0 in X has the property that 
f(Ux(0)) = Uy) C Uy). 

The hypotheses of the theorem are therefore satisfied, and we can now assert that 

_ sin 7x . siny 
lim = lim = 


x0 x y>O y 


1. 


Example 19 The function g(y) = | sgn y|, as we have seen (see Example 3), has the 
limit lim,_.9|sgny| = 1. 

The function y = f(x) =x sin i, which is defined for x 4 0, also has the limit 
lim,_,9 x sin + = 0 (see Example 1). 

However, the function (g o f)(x) = |sgn(x sin +) has no limit as x > 0. 

Indeed, in any deleted neighborhood of x = 0 there are zeros of the function 
sin x, so that the function | sgn(x sin +I assumes both the value 1 and the value 0 in 
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any such neighborhood. By the Cauchy criterion, this function cannot have a limit 
asx —> 0. 

But does this example not contradict Theorem 5? 

Check, as we did in the preceding example, to see whether the hypotheses of the 
theorem are satisfied. 


Example 20 Let us show that 


| x 
lim (1 + =) =e. 
X00 x 


Proof Let us make the following assumptions: 


Y=N, By is the basen > cw, neEN; 


X=R,={xeER|x>0}, By is the base x > +00; 
f:X—Y_ isthe mapping xt>[x], 


where [x] is the integer part of x (that is, the largest integer not larger than x). 

Then for any By = {n € N|n > N} in the base n > ov, n € N there obviously 
exists an element By = {x € R| x > N + 1} of the base x + +00 whose image 
under the mapping x b> [x] is contained in By. 

The functions g(n) = (1+ 4)", gi(n) = (1+ =4;)", and go(n) = (1 + 4)"*1, as 
we know, have the number e as their limit in the base n — oo, n EN. 

By Theorem 4 on the limit of a composite function, we can now assert that the 
functions 


1 [x] 1 [x] 
(ow) = (14755) : @on=(1+555) ; 


1 [x]+1 
(g20 f) = (1+) 
[x] 


also have e as their limit over the base x > ++oo. 
It now remains for us only to remark that 


1 [x] 1\* 1 [x]+1 
(+5) <(1+2) <(1+55) 


for x > 1. Since the extreme terms here tend to e as x — +00, it follows from 
Theorem 3 on the properties of a limit that lim,—, +5.(1 + +) =e. 


Using Theorem 5 on the limit of a composite function, we now show that 
limy——oo(1 + +)* =e. 
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Proof We write 


iy? fh, SE it 
lim (1 + -) = lim (: + =) = lim (1 — *) = 
X—>—0O x (—t)>—00 (-t t—>+0o t 


| 
i 
ae 
a, 
+ 
t= 
=o 
| 


ie ae 1 
= lim (1+) lim (1+)- 
t—>-+00 t= t— +00 f-— ] 
1 t—1 1 u 
lim (1+) = lim (1+2) =e. 
to+00 f= | u—-+oo u 


When we take account of the substitutions u = t — 1 and t = —x, these equalities 
can be justified in reverse order (!) using Theorem 5. Indeed, only after we have 
arrived at the limit limy—+oo(1 + iy, whose existence has already been proved, 
does the theorem allow us to assert that the preceding limit also exists and has the 
same value. Then the limit before that one also exists, and by a finite number of such 
transitions we finally arrive at the original limit. This is a very typical example of the 
procedure for using the theorem on the limit of a composite function in computing 
limits. 

Thus, we have 


It follows that lim, ..(1 + 1)" =e. Indeed, let ¢ > 0 be given. 

Since limy-, —59(1 + 1)x =e, there exists c; € R such that |(1 + iy —e| <e for 
X<Cj. 

Since limy-++o00(1 + iy =e, there exists cp € R such that |(1 + +)" —e| <e for 
C2 <X. 

Then for |x| > c = max{|cj|, |c2|} we have |(1+ +) —e| <e, which verifies that 


lim,o0(1 + 4)* =e. 


Example 21 We shall show that 


limd +1)! =e. 
t>0 


Proof After the substitution x = 1/t, we return to the limit considered in the pre- 
ceding example. 


Example 22 
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Proof We know (see Example 11 in Sect. 3.1) that lim,_, 7 =Oifg>1. 
Now, as in Example 20, we can consider the auxiliary mapping f :R, > N 
given by the function [x] (the integer part of x). Using the inequalities 


1 [x] x [x] +1 
qd ; qh) = q* = gh! a 


and taking account of the theorem on the limit of a composite function, we find that 
the extreme terms here tend to 0 as x — +00. We conclude that lim,_, + ra =0. 


Example 23 


: log, x 
lim 8a * = 
x—-+00 x 


Proof Let a > 1. Set t = log, x, so that x = a‘. From the properties of the expo- 
nential function and the logarithm (taking account of the unboundedness of a” for 
n €N) we have (x > +00) > (t ~ +00). Using the theorem on the limit of a 
composite function and the result of Example 22, we obtain 


log, x 


im 
x>+00 Xx t++00 q! 


If 0<a <1 we set —t =log,x,x =a‘. Then (x > +00) © (t > +00), and 
since |/a > 1, we again have 


= = lim 
x>+00 Xx t>+00 q7! t>-+o0 (1/a)! 


c. The Limit of a Monotonic Function 


We now consider a special class of numerical-valued functions, but one that is very 
useful, namely the monotonic functions. 


Definition 17 A function f : E > R defined ona set E C Ris said to be 


increasing on E if 

Vx1,%2 € E (x1 <x2 => f(x1) < f(%2)); 
nondecreasing on E if 

Wx1,x2 € E (x1 <x2=> f(a) < f(a); 
nonincreasing on E if 


Vx1,x2 € E (x1 <x2 => f(x1) = f (x2); 
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decreasing on E if 
Wx1,%2 € E (x1 <x2 => f(x) > f (2). 


Functions of the types just listed are said to be monotonic on the set E. 

Assume that the numbers (or symbols —oo or +00) i = inf E and s = sup E are 
limit points of the set EF, and let f : E — R be a monotonic function on E. Then 
the following theorem holds. 


Theorem 6 (Criterion for the existence of a limit of a monotonic function) A neces- 
sary and sufficient condition for a function f : E — R that is nondecreasing on the 
set E to have a limit as x > s,x € E, is that it be bounded above. For this function 
to have a limit as x > i, x € E, it is necessary and sufficient that it be bounded 
below. 


Proof We shall prove this theorem for the limit limgs,—+; f(x). 

If this limit exists, then, like any function having a limit, the function f is ulti- 
mately bounded over the base E 3.x > s. 

Since f is nondecreasing on E, it follows that f is bounded above. In fact, we 
can even assert that f(x) < limgsx—5 f(x). That will be clear from what follows. 

Let us pass to the proof of the existence of the limit limgs,.; f(x) when f is 
bounded above. 

Given that f is bounded above, we see that there is a least upper bound of the 
values that the function assumes on E \ {s}. Let A = sup, E\{s} f(x). We shall 
show that limgs;-+; f(x) = A. Given ¢ > 0, we use the definition of the least upper 
bound to find a point xo € E \ {s} for which A — ¢ < f (xo) < A. Then, since f 
is nondecreasing on E, we have A — € < f(x) < A for x9 < x € E \ {s}. But the 
set {x € E | x9 < x <5} is obviously an element of the base x — s, x € E (since 
s = sup E). Thus we have proved that limrs;-+5 f(x) = A. 

For the limit limgs,-.; f(x) the reasoning is analogous. In this case we have 
limgsxsi f(x) = infrex\ ti; f(x). 


d. Comparison of the Asymptotic Behavior of Functions 


We begin this discussion with some examples to clarify the subject. 

Let 2(x) be the number of primes not larger than a given number x € R. Al- 
though for any fixed x we can find (if only by explicit enumeration) the value of 
w(x), we are nevertheless not in a position to say, for example, how the function 
w(x) behaves as x —> +00, or, what is the same, what the asymptotic law of distribu- 
tion of prime numbers is. We have known since the time of Euclid that 2 (x) — +00 
as x —> +00, but the proof that (x) grows approximately like =~ was achieved 
only in the nineteenth century by P.L. Chebyshev.!? 


'3PL. Chebyshev (1821-1894) — outstanding Russian mathematician and specialist in theoretical 
mechanics, the founder of a large mathematical school in Russia. 
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When it becomes necessary to describe the behavior of a function near some 
point (or near infinity) at which, as a rule, the function itself is not defined, we say 
that we are interested in the asymptotics or asymptotic behavior of the function in a 
neighborhood of the point. 

The asymptotic behavior of a function is usually characterized using a second 
function that is simpler or better studied and which reproduces the values of the 
function being studied in a neighborhood of the point in question with small relative 
error. 

Thus, as x — +00, the function 2(x) behaves like ore as x — 0, the function 


sine behaves like the constant function 1. When we speak of the behavior of the 
: 
like x*, while in speaking of its behavior as x — 0, we shall say it behaves like 
sin i. 

We now give precise definitions of some elementary concepts involving the 
asymptotic behavior of functions. We shall make systematic use of these concepts 


at the very first stage of our study of analysis. 


function x? + x + sin+ as x — 00, we shall obviously say that it behaves basically 


Definition 18 We shall say that a certain property of functions or a certain relation 
between functions holds ultimately over a given base B if there exists B € B on 
which it holds. 


We have already interpreted the notion of a function that is ultimately constant 
or ultimately bounded in a given base in this sense. In the same sense we shall say 
from now on that the relation f(x) = g(x)h(x) holds ultimately between functions 
Jf, g, and h. These functions may have at the outset different domains of definition, 
but if we are interested in their asymptotic behavior over the base B, all that matters 
to us is that they are all defined on some element of B. 


Definition 19 The function f is said to be infinitesimal compared with the function 
g over the base 4, and we write f =g 0(g) or f = 0(g) over Bif the relation f(x) = 
a(x)g(x) holds ultimately over the 6, where a(x) is a function that is infinitesimal 
over B. 


2 


Example 24 x? = 0(x) as x > 0, since x Kee, 


1 2 
x * . 


Example 25 x = 0(x*) as x — 00, since ultimately (as long as x £0), x = 

From these examples one must conclude that it is absolutely necessary to indicate 
the base over which f = o(g). 

The notation f = o(g) is read “f is little-oh of g”. 

It follows from the definition, in particular, that the notation f =z 0(1), which 
results when g(x) = 1, means simply that f is infinitesimal over B. 


Definition 20 If f =, 0(g) and g is itself infinitesimal over B, we say that f is an 
infinitesimal of higher order than g over B. 
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Example 26 x~? = + is an infinitesimal of higher order than x~! = i as x > Oo. 


Definition 21 A function that tends to infinity over a given base is said to be an 
infinite function or simply an infinity over the given base. 


Definition 22 If f and g are infinite functions over 6 and f =go(g), we say that 
g is a higher order infinity than f over B. 


Example 27 1_, asx > 0,4 —- wasx > Oand 1 =0( ae Therefore 4 is 
x xe x x x 


a higher order infinity than ‘ asx —> 0. 


2 


At the same time, as x — 00, x* is a higher order infinity than x. 


It should not be thought that we can characterize the order of every infinity or 
infinitesimal by choosing some power x” and saying that it is of order n. 


Example 28 We shall show that fora > 1 and anyne Z 


lim —=0, 
x—-+00 q* 


that is, x” = o(a*) as x > +00. 


Proof If n < 0 the assertion is obvious. If n € N, then, setting g = 2/a, we have 


n 
q> land = ()", and therefore 
x” x \" 
lim —= lim {|—]) = lm — lim —=0 
x—>+00 q* x>+00\ q* x+>+00 q* x+>+00 q* 
n factors 


We have used (with induction) the theorem on the limit of a product and the result 
of Example 22. 


Thus, for any n € Z we obtain x” = o(a*) asx > +00 ifa>1. 


Example 29 Extending the preceding example, let us show that 
lim —=0 
x—>+00 q* 


for a > 1 and any a ER, that is, x* = o(a*) as x > +00. 


Proof Indeed, let us choose n € N such that n > a. Then for x > 1 we obtain 


Using properties of the limit and the result of the preceding example, we find that 


lim, +00 = = 0. 
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Example 30 Let us show that 


=1/x 
: a 
lim =0 
Rysx> 0 x” 


for a > 1 and any a € R, that is, a*/* = o(x")asx > 0,x€ Ry. 


Proof Setting x = —1/t in this case and using the theorem on the limit of a com- 
posite function and the result of the preceding example, we find 


qoi/* rid 


lim = lim —=0. 
Ryax>0 x” t>+o0 q! 


Example 31 Let us show that 


for a > 0, that is, for any positive exponent a we have log, x = o(x*) as x — +00. 


Proof If a > 1, we set x = a‘/“. Then by the properties of power functions and 
the logarithm, the theorem on the limit of a composite function, and the result of 
Example 29, we find 


it t 1 t 
NOB a ding lis 


li = 0. 
x—>+00 x@ t>+oo ql @ t>+00 q! 
If 0 <a <1, then 1/a > 1, and after the substitution x = a~’ /© we obtain 
_ log, x .. (t/a) Was t 
li —— = lim = im = 
x>+00 x% t>+00 at a t>+o0 (1/a)! 


Example 32 Let us show further that 
x“log,x =o(1) asx—>0, x ER, 
for any a > 0. 


Proof We need to show that limp, 5, >9x* log, x = 0 for a > 0. Setting x = 1/t 
and applying the theorem on the limit of a composite function and the result of the 
preceding example, we find 


log, t ay 


] 1/t 
ine, igo wes ti Pe Se ie 
Ryax>0 t+ +00 t@ t>+oo [% 


Definition 23 Let us agree that the notation f =g3 O(g) or f = O(g) over the base 
B (read “f is big-oh of g over B’’) means that the relation f(x) = B(x)g(x) holds 
ultimately over 6 where 6 (x) is ultimately bounded over B. 
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In particular f =p O(1) means that the function f is ultimately bounded over B. 
Example 33 (4 + sinx)x = O(x) asx > o. 


Definition 24 The functions f and g are of the same order over B, and we write 
f = g over B, if f =p O(g) and g=g O(f) simultaneously. 


Example 34 The functions (2 + sinx)x and x are of the same order as x — 00, but 
(1 + sinx)x and x are not of the same order as x — oo. 


The condition that f and g be of the same order over the base B is obviously 
equivalent to the condition that there exist c; > 0 and cz > 0 and an element B € B 
such that the relations 


ci|g(x)| <|f(@)| < c2|¢(x)| 


hold on B, or, what is the same, 
1 1 
—|f(x)| < |g@)| < —|f@)|. 
C2 Cl 


Definition 25 If the relation f(x) = y(x)g(x) holds ultimately over B where 
limg y (x) = 1, we say that the function f behaves asymptotically like g over B, 
or, more briefly, that f is equivalent to g over B. 


In this case we shall write f ~g g or f ~ g over B. 
The use of the word equivalent is justified by the relations 


(Fes); 
(fxs) => (e~/). 
(438) 4 (eg%) > (¥34) 


Indeed, the relation f ~g f is obvious, since in this case y(x) = 1. Next, if 
limg y (x) = 1, then limg say = | and g(x) = a f (x). Here all we that need to 
explain is why it is permissible to assume that y(x) 4 0. If the relation f(x) = 
y(x)g(x) holds on B, € B, and 5 <|y(x)| < on Bo € BL, then we can take Be B 
with B C By, Bo, on which both relations hold. Outside of B, if convenient, we 
may assume that y (x) = 1. Thus we do indeed have (f ~ g) > (g ~ f). 

Finally, if f(x) = v1 (x) g(x) on B; € B and g(x) = y2(x)A(x) on Bo € B, then 
on an element B € B such that B C B, N Bo, both of these relations hold simulta- 
neously, and so f(x) = yi(x)y2(x)A(x) on B. But limg 11 (x) y2(x) = limg yj (x) - 
limg y2(x) = 1, and hence we have verified that f ~gh. 
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It is useful to note that since the relation limg y (x) = | is equivalent to y(x) = 
1+ a(x), where limg a(x) = 0, the relation f ~g g is equivalent to f(x) = g(x) + 
a(x)g(x) = g(x) + 0(g(x)) over B. . 

We see that the relative error |a(x)| = | eg | in approximating f(x) by a 
function g(x) that is equivalent to f(x) over G is infinitesimal over B. 

Let us now consider some examples. 


Example 35 x74+x=(14+ 4 )x? ~ x2 as x > 00. 
The absolute value of the difference of these functions 


I(x? +2) — 2°] = [| 
tends to infinity. However, the relative error nt = un that results from replacing 


x? +x by the equivalent function x? tends to zero as x > 00. 


Example 36 At the beginning of this discussion we spoke of the famous asymptotic 
law of distribution of the prime numbers. We can now give a precise statement of 
this law: 


x x 
u(x) = — +0| — as x > +00. 
Inx Inx 


Example 37 Since limy-.0 “ = 1, we have sinx ~ x as x — O, which can also be 


written as sinx =x +o0(x) asx —> 0. 
Example 38 Let us show that In(1+x)~x asx — 0. 


Proof 


. Md+x) 
lim. ————— 
x 


x0 


= lim In(l +x)!" = In( im (1 +x)!/*) ie S41) 
x0 x0 


Here we have used the relation log, (b*) = a log, b in the first equality and the 
relation lim;_,p log, t = log, b = log, (lim;_,p ¢) in the second. 


Thus, In(1 + x) =x + o0(x) asx > 0. 
Example 39 Let us show that e* = 1+ x +0(x) asx > 0. 


Proof 


x 


lim = lim =1. 

x>0 Xx z>0In(1 + 1£) 

Here we have made the substitution x = In(1 +t), e* — 1 =¢ and used the relations 
e* > e? = 1 as x > Oande* £1 for x 40. Thus, using the theorem on the limit of 
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a composite function and the result of the preceding example, we have proved the 
assertion. 


Thus, eX —l1~xasx—- 0. 


Example 40 Let us show that (1 + x)* =1+ax+o(x) asx —> 0. 


Proof 
— G+x)*-1 | eit) 1 a@In(1 +x) 
lim = lim . = 
x0 x x>0 aln(1 +x) x 
~ e-1 . Indit+x) 
=a lim - lim =a 
t>0 t x0 x 


In this computation, assuming a 4 0, we made the substitution a In(1 + x) =¢ and 
used the results of the two preceding examples. 
If w = 0, the assertion is obvious. 


Thus, (l+x«)* —Il~axasx—> 0. 
The following simple fact is sometimes useful in computing limits. 


Proposition 3 If f ~g f, then limg f(x)g(x) = limg f (x)g(x), provided one of 
these limits exists. 


Proof Indeed, given that f(x) = y (x) f (x) and limg y (x) = 1, we have 


lim f@)g@) = lim y (x) f@)g@) = limy (x) - lim fx)g(x) = lim f(x)g(x). 


Example 41 
Incos x 1. Incos?x 1... Ind—- sin? x) 
im — = im =- lim — 
x0 sin(x?) 2 x30 x2 2 x0 x? 
1 — sin? x oe 1 
=- hm = lim = 
2 x30 x2 2x 30 x2 2 


Here we have used the relations In(1 + a) ~ a as a > 0, sinx ~ x as x > 0,7 
as ~ a as B — 0, and sin? x ~ x? as x > 0. 

We have proved that one may replace functions by other functions equivalent to 
them in a given base when computing limits of monomials. This rule should not be 


extended to sums and differences of functions. 


Example 42 ./x*2 +x ~x as x > +00, but 


lim (Vx? +x — x) A lim (x-x)=0. 
x +00 


X—> +00 
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In fact, 
ee oe ee en re eee 
dim (vx? +x A) yl Piss. i¢i41 2 
x 


We note one more widely used rule for handling the symbols o(-) and O(-) in 
analysis. 


Proposition 4 For a given base 


a) o(f) +o(f) =o(f); 
b) o(f) is also O(f); 

c) off) + O(f) = O(f); 
d) O(f) + O(f) = O(f); 


( Ae ” 
e) if g(x) £0, then “LE” — o( LB) and SES) = (£3). 


Notice some peculiarities of operations with the symbols o(-) and O(-) that 
follow from the meaning of these symbols. For example 20(f) = o(f) and 
o(f) + O(f) = O(f) (even though in general o( f) 4 0); also, o(f) = O(f), but 
O(f) £ o(f). Here the equality sign is used in the sense of “is”. The symbols o(-) 
and O(-) do not really denote a function, but rather indicate its asymptotic behavior, 
a behavior that many functions may have simultaneously, for example, f and 2f, 
and the like. 


Proof a) After the clarification just given, this assertion ceases to appear strange. 
The first symbol o(f) in it denotes a function of the form a(x) f(x), where 
limp a; (x) = 0. The second symbol o(f), which one can (or should) equip with 
some mark to distinguish it from the first, denotes a function of the form a2(x) f (x), 
where limg a2(x) = 0. Then on (x) f(x) + a2(x) f(x) = @1(x) + a2(0)) f) = 
a3(x) f (x), where limg a3 (x) = 0. 

Assertion b) follows from the fact that any function having a limit is ultimately 
bounded. 

Assertion c) follows from b) and d). 

Assertion d) follows from the fact that the sum of ultimately bounded functions 
is ultimately bounded. 

As for e), we have von = ome = ox(x) re = 0( 

The second part of assertion e) is verified similarly. 


f(x) 
g(x) ). 


Using these rules and the equivalences obtained in Example 40, we can now find 
the limit in Example 42 by the following direct method: 


| 1 
lim (Vx27+x—x)= lim +( 1+2-1)= 
X—>+00 X—>+00 x 
: 1 1 1 
lim x{1+z=-—+o0(-—]-1)= 
x—+00 2 x x 
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lim (; +x- o(- ))= 
X—>+00 

li : 1))= : 
im, (5 + 0( )) = > 


We shall soon prove the following important relations, which should be memo- 
rized at this point like the multiplication table: 


1 1 L 
e* fal+pst+ ae +. er mo... forx ER, 
cos ae i tied 2k... forxeR 
see a Tua Qk! ~ ee 
1 1 
sinx = —x — —x? +. $4 ant forx ER, 
3! (2k +1)! 
1 1 =|) 
Ind +x) =x Sat gah pep Oa for |x| <1, 
n 
a(a —1 
Gasears pe = oe 
1! 2! 
Henao 1 
go) = aula 8 for |x| <1. 
nN: 


On the one hand, these relations can already be used as computational formulas, and 
on the other hand they contain the following asymptotic formulas, which generalize 
the formulas contained in Examples 37-40: 


1 1 1 
+1 
ejaltoxtye te + at + Ol" ) as x > 0, 
1 1 1 
cosx =1— 5 x” + Gx" +. SB E44 O(x74*?) asx > 0, 
1 1 -1 
sine = fx — Bad poo PDT 4 (0h) asx > 0, 
1 2 1 3 (-1)" n n+1 
In{h xe =e a. 4+.--+ ——x + O(x ) as x > 0, 
n 
a(a—1 
(4x a4 94 OO 4g 
1! 2! 
—1 _ 1 
ee " = ates x” + O(x"*1) asx > 0. 
n! 


These formulas are usually the most efficient method of finding the limits of the 
elementary functions. When doing so, it is useful to keep in mind that O(x’"*!) = 
x™t1. O(1) =x™-xO(1) =x™o(1) = 0(x”) as x > 0. 

In conclusion, let us consider a few examples showing these formulas in action. 
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Example 43 
x —sinx x—(x— 7x3 + O(2)) I 0 I 
lim ——.—— = lim ; = lim —+ O(x ) =: 
x>0 x3 x0 x3 x 0\ 3! 3! 


Example 44 Let us find 


As x > 00 we have: 


xetx 1ltx7 ig ies 
l+x3 14x37 x2 x3 


(+ 2)0-2+(z) 


from which we obtain 


fxe+x 1 9 1 1 
ra PRS aye Fo 3 as x > OO. 


Hence the required limit is 


Example 45 


snl (e2)T-mmenllo(2) -3]- 
oor |= 


Jim aie: mt o(=)) | = 


= lim exp 


jea(Iee* 


lim exp 
xX> CO 
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3.2.5 Problems and Exercises 


1. a) Prove that there exists a unique function defined on R and satisfying the fol- 
lowing conditions: 


fQ=a (a>0,aFl), 
fi): f 2) = f(x1 +22), 
t(x) > f(xo) asx x0. 


b) Prove that there exists a unique function defined on Ry and satisfying the 
following conditions: 


f@=1 (a>0,a41), 
f (x1) + f2) = fr - x2), 


f(x) > fo) forxp eR, and R, 3x — xo. 


Hint: Look again at the construction of the exponential function and logarithm dis- 
cussed in Example 10. 


2. a) Establish a one-to-one correspondence g : R > R + such that g(x + y) = 
y(x)- g(y) for any x, y € R, that is, so that the operation of multiplication in the 
image (R) corresponds to the operation of addition in the pre-image (R). The 
existence of such a mapping means that the groups (R, +) and (R4, -) are identical 
as algebraic objects, or, as we say, they are isomorphic. 

b) Prove that the groups (R, +) and (R\0, -) are not isomorphic. 


3. Find the following limits. 


a) limy_,+49x* 


b) limy++00% roa 
log, (1+ x) , 
—— 5 @ 


c) limy—+o 
d) limy 9 S=!, 
4. Show that 


1 1 
Teh eee Ine ec O01) asn— oO, 
n 


where c is a constant. (The number c = 0.57721... is called Euler’s constant.) 
Hint: One can use the relation 


n+1 1 1 1 
In =Inf{1+—-—]=-+0 5 asn—> oo. 
n n n n 


5. Show that 
a) if two series pene dy and peer , Pn With positive terms are such that ay ~ by 
as n —> oo, then the two series either both converge or both diverge; 
: Co ol 
b) the series }°7~, sin => converges only for p > 1. 
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6. Show that 


a) if ad, > an+1 > 0 for all n € N and the series Sea a, converges, then ay, = 
o(+) asn—> oo; 

b) ifb, = o(+), one can always construct a convergent series yy dn such that 
by, = 0(dy) asn > CO; 


c) if a series pa ay With positive terms converges, then the series b pie 1 An, 


where A, = pen ap — Foe ay also converges, and dy = 0(An) asin > 00; 


d) if a series }°°° , a, with positive terms diverges, then the series }°°°., An, 


where A, = ge an — os ag also diverges, and Ay, = 0(dyn) as n > 0. 

It follows from c) and d) that no convergent (resp. divergent) series can serve as 
a universal standard of comparison to establish the convergence (resp. divergence) 
of other series. 


7. Show that 


a) the series ar Indy, where a, > 0, n € N, converges if and only if the se- 
quence {IT = a) ---d,} has a finite nonzero limit. 
b) the series yy In(1 + a@,), where |a,| < 1, converges absolutely if and only 


if the series )°°° | a converges absolutely. 


Hint: See part a) of Exercise 5. 


8. An infinite product []7° | ex is said to converge if the sequence of numbers 7, = 
Tie ex has a finite nonzero limit J7. We then set JT = [|< ex. 
Show that 


a) if an infinite product Th én converges, then e, > 1 asn > cw; 

b) if Vn € N (e, > 0), then the infinite product []7< , én converges if and only if 
the series ead , Inéy converges; 

c) if e, = 1+ a, and the a, are all of the same sign, then the infinite product 


T1721. + an) converges if and only if the series }“P°_, a» converges. 
9. a) Find the product Thr (1x71), 


b) Find []}~, cos 37 and prove the following theorem of Viate!* 


uw 1 
z= 
I Lat fl Lyd fia) Fl 
Vb s+dy/3- 2t3y beth... 


c) Find the function f(x) if 


f@)=1, 


'4F vViate (1540-1603) — French mathematician, one of the creators of modern symbolic algebra. 
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f (2x) =cos” x - f (x), 
fQx)— fO) asx 0. 


Hint: x = 2-35. 
10. Show that 

a) if i =1+6,,n=1,2,..., and the series b Saaer, Bn converges absolutely, 
then the limit limy_. 09 by = b € R exists; 

b) if ae =1+2+a,,n=1,2,..., and the series }°”° | a, converges abso- 


lutely, then a, ~ <5 as n —> 00; 

c) if the series )~°° | a, is such that aa = 1+ +a, and the series )°> | dn 
converges absolutely, then )°°° ; an converges absolutely for p > 1 and diverges 
for p < 1 (Gauss’ test for absolute convergence of a series). 


11. Show that 
eee 62 | ut 
n—>oo an 


for any sequence {a,} with positive terms, and that this estimate cannot be improved. 


Chapter 4 
Continuous Functions 


4.1 Basic Definitions and Examples 


4.1.1 Continuity of a Function at a Point 


Let f be a real-valued function defined in a neighborhood of a point a € R. In 
intuitive terms the function f is continuous at a if its value f(x) approaches the 
value f(a) that it assumes at the point a itself as x gets nearer to a. 

We shall now make this description of the concept of continuity of a function at 
a point precise. 


Definition 0 A function f is continuous at the point a if for any neighborhood 
V(f(a)) of its value f(a) at a there is a neighborhood U(a) of a whose image 
under the mapping f is contained in V(f(a)). 


We now give the expression of this concept in logical symbolism, along with two 
other versions of it that are frequently used in analysis. 


(f is continuous at a) := (VV(f(a)) SU) (f(U@) C V(f@))), 
Ve > 0 3U(a) Wx € U(a) (|f (&) — f@| <e), 
Ve > 055 > 0 Vx ER (|x —a| <d > | f(x) — f@| <e). 


The equivalence of these statements for real-valued functions follows from the 
fact (already noted several times) that any neighborhood of a point contains a sym- 
metric neighborhood of the point. 

For example, if for any e-neighborhood V*(f(a)) of f(a) one can choose 
a neighborhood U(a) of a such that Vx € U(a) (|f(x) — f(a)| < e), that is, 
f(U(a)) c V*(f(q@), then for any neighborhood V(f(a)) one can also choose a 
corresponding neighborhood of a. Indeed, it suffices first to take an e-neighborhood 
of f(a) with V°(f(a)) C VCf(a)), and then find U (a) corresponding to V*(f (a)). 
Then f (U(a)) © V°(f(a)) C V(f(@). 
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Thus, if a function is continuous at a in the sense of the second of these defini- 
tions, it is also continuous at a in the sense of the original definition. The converse 
is obvious, so that the equivalence of the two statements is established. 

We leave the rest of the verification to the reader. 

To avoid being distracted from the basic concept being defined, that of continuity 
at a point, we assumed for simplicity to begin with that the function f was defined 
in a whole neighborhood of a. We now consider the general case. 

Let f : E > R be a real-valued function defined on some set E C R andaa 
point of the domain of definition of the function. 


Definition 1 A function f : E > R is continuous at the point a € E if for every 
neighborhood V (f (a)) of the value f(a) that the function assumes at a there exists 
a neighborhood Uf (a) of a in E! whose image f (Ug (a)) is contained in V(f (a)). 


Thus 


(f : E — Ris continuous at a € E) := 
=(W (f@) Ue@ (f(UeE@) CV(F@))). 


Of course, Definition 1 can also be written in the e—d-form discussed above. Where 
numerical estimates are needed, this will be useful, and even necessary. 
We now write these versions of Definition 1. 


(f : E — Ris continuous at a € E) := 
= (Ve > 0 dU g(a) Vx € Usa) (|) — f@| <e)), 
or 


(f : E > Ris continuous at a € E):= 
= (Ve > 055 > Ox € E (|x —a| <5 > | f(x) — f(@| <e)). 


We now discuss in detail the concept of continuity of a function at a point. 

1° If a is an isolated point, that is, not a limit point of E, there is a neighbor- 
hood U (a) of a containing no points of E except a itself. In this case Ug(a) =a, 
and therefore f(Ug(a)) = f(a) C V(f(a)) for any neighborhood V(f(a)). Thus 
a function is obviously continuous at any isolated point of its domain of definition. 
This, however, is a degenerate case. 

2° The substantive part of the concept of continuity thus involves the case when 
a € E anda is a limit point of E. It is clear from Definition | that 


(f : E > Ris continuous at a € E, where a is a limit point of E) > 


o (<i, 09 s00) 


1We recall that Ug (a) = ENU(a). 
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Proof In fact, if a is a limit point of EZ, then the base E 5 x — a of deleted neigh- 
borhoods Ug (a) = Ug(a)\a of a is defined. 

If f is continuous at a, then, by finding a neighborhood Ug (a) for the neigh- 
borhood V(f(a)) such that f(Ug(a)) C V( f(a), we will simultaneously have 
f(g (a)) C V(f (a)). By definition of limit, therefore, limgs,+¢ f(x) = f(a). 

Conversely, if we know that limgs;a f (x) = = f(a), then, given a neighborhood 
V(f(a)), we find a deleted neighborhood Ur (a) such that f(UE (a)) C V(f(a)). 
But since f(a) € V(f(a)), we then have also f(Ug(a)) C V(f(a)). By Defini- 
tion | this means that f is continuous ata € E. 


3° Since the relation lim Esx—a f (x) = f(a) can be rewritten as 


lim f@)=f(,,lim 2). 


we now arrive at the useful conclusion that the continuous functions (operations) 
and only the continuous ones commute with the operation of passing to the limit at 
a point. This means that the number f(a) obtained by carrying out the operation f 
on the number a can be approximated as closely as desired by the values obtained 
by carrying out the operation f on values of x that approximate a with suitable 
accuracy. 

4° If we remark that for a € E the neighborhoods Ug(a) of a form a base By, 
(whether a is a limit point or an isolated point of E), we see that Definition | of 
continuity of a function at the point a is the same as the definition of the statement 
that the number f(a) — the value of the function at a — is the limit of the function 
over this base, that is 


(f : E > Ris continuous at a € E) > (lim f@m= f(a)). 


5° We remark, however, that if limp, f(x) exists, since a ¢ Ug(a) for every 
neighborhood Ug (a), it follows that this limit must necessarily be f(a). 

Thus, continuity of a function f : E > R at a point a € E is equivalent to the 
existence of the limit of this function over the base B, of neighborhoods (not deleted 
neighborhoods) Ug(a) of ae€ E. 

Thus 


(f : E > Ris continuous ata € E) © (3 pin f(x)). 


6° By the Cauchy criterion for the existence of a limit, we can now say that a 
function is continuous at a point a € E if and only if for every ¢ > O there exists a 
neighborhood U¢ (a) of a in E on which the oscillation w(f; Ug (a)) of the function 
is less than e. 


Definition 2 The quantity w(f; a) = lims.+0 @(f; U%(a)) (where U2 (a) is the 
5-neighborhood of a in E) is called the oscillation of f : E > Rata. 


152 4 Continuous Functions 


Formally the symbol w(f; X) has already been taken; it denotes the oscillation 
of the function on the set X. However, we shall never consider the oscillation of a 
function on a set consisting of a single point (it would obviously be zero); therefore 
the symbol w(f; a), where a is a point, will always denote the concept of oscillation 
at a point just defined in Definition 2. 

The oscillation of a function on a subset of a set does not exceed its oscillation 
on the set itself, so that w(f; ue (a)) is a nondecreasing function of 6. Since it is 
nonnegative, either it has a finite limit as 6 > +0, or else w(f; Ue (a)) = +00 for 
every 6 > 0. In the latter case we naturally set w(f; a) = +00. 

7° Using Definition 2 we can summarize what was said in 6° as follows: a func- 
tion is continuous at a point if and only if its oscillation at that point is zero. Let us 
make this explicit: 


(f : E > Ris continuous ata € E) © (w(f;a) =0). 


Definition 3 A function f : E > R is continuous on the set E if it is continuous at 
each point of E. 


The set of all continuous real-valued functions defined on a set E will be denoted 
C(E; R) or, more, C(E). 

We have now discussed the concept of continuity of a function. Let us consider 
some examples. 


Example I If f : E — R is a constant function, then f € C(£). This is obvious, 
since f(E) =c C V(c), for any neighborhood V(c) of cE R. 


Example 2 The function f(x) = x is continuous on R. Indeed, for any point x9 € R 
we have | f(x) — f(xo)| = |x — xo| < € provided |x — x9| <d =e. 


Example 3 The function f(x) = sinx is continuous on R. 
In fact, for any point xg € R we have 


X+X0 . X—XO 
sin 


|sinx — sinxo| = |2 cos 


= |x — xol <e, 


provided |x —xo| <éd=e. 
Here we have used the inequality | sinx| < |x| proved in Example 9 of Para- 
graph d) of Sect. 3.2.2. 


Example 4 The function f(x) = cos.x is continuous on R. 
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Indeed, as in the preceding example, for any point x9 € R we have 


X+X0 . X—XO0 


sin 
2 2 


|cosx — cos xo| = | 2sin 


<2 


sin 


< |x —xo| <, 


provided |x —xo| <éd=e. 


Example 5 The function f(x) = a* is continuous on R. 
Indeed by property 3) of the exponential function (see Paragraph d in Sect. 3.2.2, 
Example 10a), at any point xo € R we have 


lim a* =a™, 
x—>Xx9 


which, as we now know, is equivalent to the continuity of the function a* at the 
point xo. 


Example 6 The function f(x) = log, x is continuous at any point xo in its domain 
of definition Ry = {x € R| x > 0}. 

In fact, by property 3) of the logarithm (see Paragraph d in Sect. 3.2.2, Example 
10b), at each point x9 € Ry we have 


lim log, x = log, xo, 
R,Sx—>x0 


which is equivalent to the continuity of the function log, x at the point xo. 
Now, given € > 0, let us try to find a neighborhood Up, (x0) of the point xo so as 
to have 


|log, x — log, xo| < € 


at each point x € Up, (x0). 
This inequality is equivalent to the relations 


x 
—é < log, — <e. 
X0 
For definiteness assume a > 1; then these last relations are equivalent to 
xoa&§ <x <xoa*. 
The open interval |xga~*, xga*[ is the neighborhood of the point x9 that we 


are seeking. It is useful to note that this neighborhood depends on both ¢ and the 
point xo, a phenomenon that did not occur in Examples 1-4. 
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Example 7 Any sequence f : N > R is a function that is continuous on the set N 
of natural numbers, since each point of N is isolated. 


4.1.2 Points of Discontinuity 


To improve our mastery of the concept of continuity, we shall explain what happens 
to a function in a neighborhood of a point where it is not continuous. 


Definition 4 If the function f : E — R is not continuous at a point of E, this point 
is called a point of discontinuity or simply a discontinuity of f. 


By constructing the negation of the statement “the function f : E — R is contin- 
uous at the point a € E”’, we obtain the following expression of the definition of the 
statement that a is a point of discontinuity of /: 


(a € E is a point of discontinuity of f) := 
= (AV(f(@) WWe(a) 3x € Ue@ (f(@) ¢ V(F@)))- 


In other words, a € E is a point of discontinuity of the function f : E > R if 
there is a neighborhood V(f(a)) of the value f(a) that the function assumes at a 
such that in any neighborhood Ug (a) of a in E there is a point x whose image is 
not in V(f(a)). 

In e—d-form, this definition has the following appearance: 


de > OVS > 04x € E (|x —a| <8 A|f (x) — f(@| =e). 
Let us consider some examples. 


Example 8 The function f(x) = sgnx is constant and hence continuous in the 
neighborhood of any point a € R that is different from 0. But in any neighbor- 
hood of 0 its oscillation equals 2. Hence 0 is a point of discontinuity for sgnx. We 
remark that this function has a left-hand limit lim,_,_9 sgnx = —1 and a right-hand 
limit lim,_,+49sgnx = 1. However, in the first place, these limits are not the same; 
and in the second place, neither of them is equal to the value of sgn x at the point 0, 
namely sgn0 = 0. This is a direct verification that 0 is a point of discontinuity for 
this function. 


Example 9 The function f (x) = | sgnx| has the limit lim,_,9 | sgn.x| = 1 as x > 0, 
but f (0) = |sgn0| = 0, so that lim,.9 f(x) € f(O), and 0 is therefore a point of 
discontinuity of the function. 

We remark, however, that in this case, if we were to change the value of the 
function at the point 0 and set it equal to 1 there, we would obtain a function that is 
continuous at 0, that is, we would remove the discontinuity. 
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Fig. 4.1 


Definition 5 If a point of discontinuity a € E of the function f : E > R is such 
that there exists a continuous function f : E > R such that f|e\a = flz\a, then a 
is called a removable discontinuity of the function /f. 


Thus a removable discontinuity is characterized by the fact that the limit 
limgsx—+a f(x) =A exists, but A ~ f(a), and it suffices to set 


f(x) forxe E,x 4a, 


A for x =a, 


f@)= 


in order to obtain a function f : E — R that is continuous at a. 
Example 10 The function 


- 
sin=, forx 40, 

— x 
a) a 


is discontinuous at 0. Moreover, it does not even have a limit as x — 0, since, as 
was shown Example 5 in Sect. 3.2.1, lim,—,9 sin + does not exist. The graph of the 


function sin i is shown in Fig. 4.1. 
Examples 8, 9 and 10 explain the following terminology. 


Definition 6 The point a € E is called a discontinuity of first kind for the function 
f : E— R if the following limits? exist: 


polim_f@=f@-0, lim | f(x) =: fa+0), 


but at least one of them is not equal to the value f(a) that the function assumes at a. 


7If a is a discontinuity, then a must be a limit point of the set E. It may happen, however, that all 
the points of E in some neighborhood of a lie on one side of a. In that case, only one of the limits 
in this definition is considered. 
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Definition 7 If a € E is a point of discontinuity of the function f : E > R and at 
least one of the two limits in Definition 6 does not exist, then a is called a disconti- 
nuity of second kind. 


Thus what is meant is that every point of discontinuity that is not a discontinuity 
of first kind is automatically a discontinuity of second kind. 
Let us present two more classical examples. 


Example 11 The function 


1, ifxeQ, 


Dix) = . 
0, ifxeR\Q, 


is called the Dirichlet function.° 


This function is discontinuous at every point, and obviously all of its discontinu- 
ities are of second kind, since in every interval there are both rational and irrational 
numbers. 


Example 12 Consider the Riemann function* 


1 ifxa™e Q, where ™ is in lowest terms, n € N, 
R(x) — n n n 


0, ifxeR\Q. 


We remark that for any point a € R, any bounded neighborhood U (a) of it, and any 
number N €N, the neighborhood U(a) contains only a finite number of rational 
numbers *,m €Z,né€N, withn < N. 

By shrinking the neighborhood, one can then assume that the denominators of 
all rational numbers in the neighborhood (except possibly for the point a itself if 
a € Q)are larger than N. Thus at any point x € U(a) we have |R(x)| < 1/N. 

We have thereby shown that 


lim R(x) =0 
xa 


at any point a € R\Q. Hence the Riemann function is continuous at any irrational 
number. At the remaining points, that is, at points x € Q, the function is discontinu- 
ous, except at the point x = 0, and all of these discontinuities are discontinuities of 
first kind. 


3P.G. Dirichlet (1805-1859) — great German mathematician, an analyst who occupied the post of 
professor ordinarius at Géttingen University after the death of Gauss in 1855. 


4B.F. Riemann (1826-1866) — outstanding German mathematician whose ground-breaking works 
laid the foundations of whole areas of modern geometry and analysis. 


4.2 Properties of Continuous Functions 157 


4.2 Properties of Continuous Functions 


4.2.1 Local Properties 


The local properties of functions are those that are determined by the behavior of 
the function in an arbitrarily small neighborhood of the point in its domain of defi- 
nition. 

Thus, the local properties themselves characterize the behavior of a function in 
any limiting relation when the argument of the function tends to the point in ques- 
tion. For example, the continuity of a function at a point of its domain of definition 
is obviously a local property. 

We shall now exhibit the main local properties of continuous functions. 


Theorem 1 Let f : E — R bea function that is continuous at the point a € E. Then 
the following statements hold. 


1° The function f : E — R is bounded in some neighborhood Ug (a) of a. 

2° If f(a) £0, then in some neighborhood Ug(a) all the values of the function 
have the same sign as f (a). 

3° If the function g : Ug (a) — R is defined in some neighborhood of a and, like f, 
is continuous at a, then the following functions are defined in some neighbor- 
hood of a and continuous at a: 


a) (f + g)(x) = f(x) + g(x), 
b) (F- 8)@) = f@)- 8), 
c) (Aa) = oe (provided g(a) #0). 


4° If the function g : Y > R is continuous at a point b € Y and f is such that 
f:E— Y, f(@ =), and f is continuous at a, then the composite function 
(g o f) is defined on E and continuous at a. 


Proof To prove this theorem it suffices to recall (see Sect. 4.1) that the continuity 
of the function f or g at a point a of its domain of definition is equivalent to the 
condition that the limit of this function exists over the base 6, of neighborhoods of 
a and is equal to the value of the function at a: limg, f(x) = f(a), limg, g(x) = 
g(a). 

Thus assertions 1°, 2°, and 3° of Theorem | follow immediately from the defi- 
nition of continuity of a function at a point and the corresponding properties of the 
limit of a function. 


The only explanation required is to verify that the ratio £ at is actually defined 


in some neighborhood Ug (a) of a. But by hypothesis g(a) ¥ 0, and by assertion 2° 
of the theorem there exists a neighborhood Uz (a) at every point of which g(x) 40, 


that is, nen is defined in Ug (a). 
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Assertion 4° of Theorem 1 is a consequence of the theorem on the limit of a 
composite function, by virtue of which 


lim(g o f)(x) = lirn g(y) = g(b) = (f(a) = (go f)@), 


which is equivalent to the continuity of (go f) ata. 

However, to apply the theorem on the limit of a composite function, we must ver- 
ify that for any element Uy (b) of the base By there exists an element Ug (a) of the 
base B, such that f(Ug(a)) C Uy (b). But in fact, if Uy (b) = YOU (b), then by def- 
inition of the continuity of f : E — Y at the point a, given a neighborhood U(b) = 
U(f (a)), there is a neighborhood Ug (a) of a in E such that f(Ug(a)) C U(f(a)). 
Since the range of f is contained in Y, we have f(Ug(a)) C YNU(f(a)) = Uy (db), 
and we have justified the application of the theorem on the limit of a composite func- 
tion. 


Example 1 An algebraic polynomial P(x) = agx”" + ajx"~! +-+++ ay is a contin- 
uous function on R. 

Indeed, it follows by induction from 3° of Theorem | that the sum and prod- 
uct of any finite number of functions that are continuous at a point are themselves 
continuous at that point. We have verified in Examples | and 2 of Sect. 4.1 that 
the constant function and the function f(x) =x are continuous on R. It then fol- 
lows that the functions ax” =a-x-...-x are continuous, and consequently the 

ae 


m factors 
polynomial P(x) is also. 


Example 2 A rational function R(x) = is is ye quotient of polynomials — is contin- 
uous wherever it is defined, that is, where Q(x) 4 0. This follows from Example 1 
and assertion 3° of Theorem 1. 


Example 3 The composition of a finite number of continuous functions is continu- 
ous at each point of its domain of definition. This follows by induction from asser- 
tion 4° of Theorem 1. For example, the function esin” (In| cos.x|) is continuous on all 
of R, except at the points 5(2k + 1), k € Z, where it is not defined. 


4.2.2 Global Properties of Continuous Functions 


A global property of a function, intuitively speaking, is a property involving the 
entire domain of definition of the function. 


Theorem 2 (The Bolzano—Cauchy intermediate-value theorem) [fa function that is 
continuous on a Closed interval assumes values with different signs at the endpoints 
of the interval, then there is a point in the interval where it assumes the value 0. 
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In logical symbols, this theorem has the following expression.> 


(f € Cla, b] A f(a)- f(b) <0) > Ac € [a,b] (f(c) =0). 


Proof Let us divide the interval [a, b] in half. If the function does not assume the 
value 0 at the point of division, then it must assume opposite values at the endpoints 
of one of the two subintervals. In that interval we proceed as we did with the original 
interval, that is, we bisect it and continue the process. 

Then either at some step we hit a point c € [a, b] where f(c) = 0, or we obtain 
a sequence {J,,} of nested closed intervals whose lengths tend to zero and at whose 
endpoints f assumes values with opposite signs. In the second case, by the nested 
interval lemma, there exists a unique point c € [a, b] common to all the intervals. 
By construction there are two sequences of endpoints {x/,} and {x/"} of the inter- 
vals I, such that f(x) <0 and f(x!) > 0, while limy 0x), = limpo0.x/) =. 
By the properties of a limit and the definition of continuity, we then find that 
limn—oo f (x/,) = f(©) < 0 and limpoo f (xf) = f(c) = 0. Thus f(c) = 0. 


Remarks to Theorem 2 1° The proof of the theorem provides a very simple algo- 
rithm for finding a root of the equation f(x) = 0 on an interval at whose endpoints 
a continuous function f(x) has values with opposite signs. 

2° Theorem 2 thus asserts that it is impossible to pass continuously from positive 
to negative values without assuming the value zero along the way. 

3° One should be wary of intuitive remarks like Remark 2°, since they usually 
assume more than they state. Consider, for example, the function equal to —1 on 
the closed interval [0,1] and equal to 1 on the closed interval [2, 3]. It is clear 
that this function is continuous on its domain of definition and assumes values with 
opposite signs, yet never assumes the value 0. This remark shows that the property 
of a continuous function expressed by Theorem 2 is actually the result of a certain 
property of the domain of definition (which, as will be made clear below, is the 
property of being connected). 


Corollary to Theorem 2 /f the function g is continuous on an open interval and 
assumes values g(a) = A and ¢(b) = B at points a and b, then for any number C 
between A and B, there is a point c between a and b at which p(c) = C. 


Proof The closed interval J with endpoints a and b lies inside the open interval 
on which ¢ is defined. Therefore the function f(x) = g(x) — C is defined and 
continuous on J. Since f(a)- f(b) = (A — C)(B — C) < 0, Theorem 2 implies 
that there is a point c between a and b at which f(c) = g(c) -C =0. 


Theorem 3 (The Weierstrass maximum-value theorem) A function that is continu- 
ous on a closed interval is bounded on that interval. Moreover there is a point in 


5 We recall that C(£) denotes the set of all continuous functions on the set E. In the case E = [a, b] 
we often write, more briefly, C[a, b] instead of C([a, b]). 
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the interval where the function assumes its maximum value and a point where it 
assumes its minimal value. 


Proof Let f : E — R be a continuous function on the closed interval E = [a, b]. 
By the local properties of a continuous function (see Theorem |) for any point 
x € E there exists a neighborhood U (x) such that the function is bounded on the set 
Ur(x) = EN U(x). The set of such neighborhoods U(x) constructed for all x € E 
forms a covering of the closed interval [a, b] by open intervals. By the finite cover- 
ing lemma, one can extract a finite system U(x;),..., U(x,) of open intervals that 
together cover the closed interval [a, b]. Since the function is bounded on each set 
EQU (xg) = VE(xx), that is, m, < f(x) < Mx, where mg, and M, are real numbers 
and x € Ug (xx), we have 


min{m 1,...,™y,}< f(x) <max{M),..., My} 


at any point x € E = [a, b]. It is now established that f(x) is bounded on [a, b]. 

Now let M = sup, er f(x). Assume that f(x) < M at every point x € E. Then 
the continuous function M — f(x) on E is nowhere zero, although (by the defini- 
tion of M) it assumes values arbitrarily close to 0. It then follows that the function 
wow is, on the one hand, continuous on E because of the local properties of con- 
tinuous functions, but on the other hand not bounded on E, which contradicts what 
has just been proved about a function continuous on a closed interval. 

Thus there must be a point xy € [a, b] at which f(xy) = M. 

Similarly, by considering m = infyeg f(x) and the auxiliary function FG? 
we prove that there exists a point x», € [a, b] at which f(x») =m. 


We remark that, for example, the functions f|(x) = x and fo(x) = + are contin- 
uous on the open interval E = ]0, 1[, but f| has neither a maximal nor a minimal 
value on EF, and f> is unbounded on E. Thus, the properties of a continuous function 
expressed in Theorem 3 involve some property of the domain of definition, namely 
the property that from every covering of E by open intervals one can extract a finite 
subcovering. From now on we shall call such sets compact. 

Before passing to the next theorem, we give a definition. 


Definition 1 A function f : E > R is uniformly continuous on a set E C R if for 
every € > 0 there exists 5 > 0 such that | f (x1) — f(x2)| < ¢ for all points x1,x2 € E 
such that |x; — x2| <6. 


More briefly, 


(f : E > Ris uniformly continuous) := 


= (Ve > 0.35 > 0 Vx) € E Vx2 € E (xj — 91 <5 > | f (x1) — f (2) <e)). 


Let us now discuss the concept of uniform continuity. 
1° If a function is uniformly continuous on a set, it is continuous at each point of 
that set. Indeed, in the definition just given it suffices to set x; =x and x2 =a, and 
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we see that the definition of continuity of a function f : E > R ata pointa € E is 
satisfied. 

2° Generally speaking, the continuity of a function does not imply its uniform 
continuity. 


Example 4 The function f(x) = sin i. which we have encountered many times, is 
continuous on the open interval ]0, 1[ = E. However, in every neighborhood of 0 
in the set E the function assumes both values —1 and 1. Therefore, for ¢ < 2, the 
condition | f (x1) — f(x2)| < e does not hold. 

In this connection it is useful to write out explicitly the negation of the property 
of uniform continuity for a function: 


(f : E > Ris not uniformly continuous) := 


= (de > 0 V6 > 0 Ax, € E 3x2 € E (|x1 — x2] <5 A | f(a) — f@2)| =e). 


This example makes the difference between continuity and uniform continuity 
of a function on a set intuitive. To point out the place in the definition of uniform 
continuity from which this difference proceeds, we give a detailed expression of 
what it means for a function f : E — R to be continuous on E: 


(f : E > Ris continuous on £) := 


= (Va € EVe > 035 > OVx € E (|x —a| <8 > |f (x) — f(@| <s)). 


Thus the number 6 is chosen knowing the point a € E and the number ¢, and so 
for a fixed ¢ the number 6 may vary from one point to another, as happens in the 
case of the function sin + considered in Example |, or in the case of the function 
log, x or a* studied over their full domain of definition. 

In the case of uniform continuity we are guaranteed the possibility of choosing 6 
knowing only ¢ > 0 so that |x — a| < 6 implies | f(x) — f(a)| <e for all x € E and 


ack. 


Example 5 If the function f : E — R is unbounded in every neighborhood of a 
fixed point xo € E, then it is not uniformly continuous. 


Indeed, in that case for any 6 > 0 there are points x; and x2 in every 5. 
neighborhood of xo such that | f (x1) — f(x2)| > 1 although |x; — x2| <6. 

Such is the situation with the function f(x) = i on the set R\O. In this case 
xo = 0. The same situation holds in regard to log, x, which is defined on the set of 
positive numbers and unbounded in a neighborhood of xo = 0. 


Example 6 The function f(x) = x”, which is continuous on R, is not uniformly 
continuous on R. 
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In fact, at the points x) = /n + 1 and x!’ = /n, where n € N, we have f(x/,) = 
n+ land f(x) =n, so that f(x’, y= fe hea. But 


fim (vn 1— fn) = lim ————~ a =0, 


so that for any 6 > 0 there are points x/, and x," such that |xj, — x/’| <6, yet f(x;,) — 


f(x) =1. 


Example 7 The function f(x) = sin(x7), which is continuous and bounded on R, 
is not uniformly continuous on R. Indeed, at the points xj, = ,/5(n + 1) and xj) = 


en, where n € N, we have | f (x/,) — f (x;)| = 1, while limps oo |x), — x/7| = 0. 


After this discussion of the concept of uniform continuity of a function and com- 
parison of continuity and uniform continuity, we can now appreciate the following 
theorem. 


Theorem 4 (The Cantor—Heine theorem on uniform continuity) A function that is 
continuous on a closed interval is uniformly continuous on that interval. 


We note that this theorem is usually called Cantor’s theorem in the literature. To 
avoid unconventional terminology we shall preserve this common name in subse- 
quent references. 


Proof Let f : E > R be a given function, E = [a,b], and f € C(E). Since f is 
continuous at every point x € E, it follows (see 6° in Sect. 4.1.1) that, knowing e > 0 
we can find a 6-neighborhood U 5(x) of x such that the oscillation o(f; us (x)) 
of f on the set ue (x) = EN U%(x), consisting of the points in the domain of 
definition E lying in U°(x), is less than ¢. For each point x € E we construct a 
neighborhood U®(x) having this property. The quantity 6 may vary from one point 
to another, so that it would be more accurate, if more cumbersome, to denote the 
neighborhood by the symbol U°“) (x), but since the whole symbol is determined by 
the point x, we can agree on the following abbreviated notation: U(x) = U 5) (x) 
and V(x) = U®)/2(x), 

The open intervals V(x), x € E, taken together, cover the closed interval 
[a,b], and so by the finite covering lemma one can select a finite covering 
V(x1),---, Vin). Let 5 = min{5(x1), ..., 35(%n)}. We shall show that | f(x’) — 
f(x")| < e for any points x’, x” € E such that |x’ — x”| < 6. Indeed, since the sys- 
tem of open intervals V(x1),..., V(xn) covers E, there exists an interval V (x;) of 
this system that contains x’, that is |x’ — x;| < 55(xi). But in that case 


" F e / £ . 1 ; 1 ; 1 7 ae . 
|x” — xj] < |x’ —x"| + |x’ — 3; | <8 + 58(%) < 58(x1) + 58) = 5G). 
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Consequently x’,x” € US“? (x;) = EN UPD (x;) and so |f(x’) — f(x”) < 
o(f; UL (;)) <e. 


The examples given above show that Cantor’s theorem makes essential use of a 
certain property of the domain of definition of the function. It is clear from the proof 
that, as in Theorem 3, this property is that from every covering of E by neighbor- 
hoods of its points one can extract a finite covering. 

Now that Theorem 4 has been proved, it is useful to return once again to the 
examples studied earlier of functions that are continuous but not uniformly continu- 
ous, in order to clarify how it happens that sin(x?) for example, which is uniformly 
continuous on each closed interval of the real line by Cantor’s theorem, is never- 
theless not uniformly continuous on R. The reason is completely analogous to the 
reason why a continuous function in general fails to be uniformly continuous. This 
time we invite our readers to investigate this question on their own. 

We now pass to the last theorem of this section, the inverse function theorem. 
We need to determine the conditions under which a real-valued function on a closed 
interval has an inverse and the conditions under which the inverse is continuous. 


Proposition 1 A continuous mapping f : E > R of a closed interval E = [a, b] 
into R is injective if and only if the function f is strictly monotonic on [a, b]. 


Proof If f is increasing or decreasing on any set E C R whatsoever, the mapping 
f : E — R is obviously injective: at different points of E the function assumes 
different values. 

Thus the more substantive part of Proposition | consists of the assertion that ev- 
ery continuous injective mapping f : [a, b] > R is realized by a strictly monotonic 
function. 

Assuming that such is not the case, we find three points x1 < x2 < x3 in [a, b] 
such that f (x2) does not lie between f(x;) and f (x3). In that case, either f (x3) lies 
between f(x;) and f(x2) or f(x1) lies between f (x2) and f (x3). For definiteness 
assume that the latter is the case. By hypothesis f is continuous on [x2, x3]. There- 
fore, by Theorem 2, there is a point x} in this interval such that f(x}) = f (x1). 
We then have x; < x}, but f(x1) = f(x}), which is inconsistent with the injectiv- 
ity of the mapping. The case when / (x3) lies between f(x) and f (x2) is handled 
similarly. 


Proposition 2 Each strictly monotonic function f : X — R defined on a numerical 
set X CR has an inverse f—~':Y — R defined on the set Y = f (X) of values of f, 
and has the same kind of monotonicity on Y that fhas on X. 


Proof The mapping f : X > Y = f(X) is surjective, that is, it is a mapping of X 
onto Y. For definiteness assume that f : X — Y is increasing on X. In that case 


Vx, € X Wx2 € X (x1 < 2x2  f (x1) < f(x). (4.1) 
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Thus the mapping f : X — Y assumes different values at different points, and 
so is injective. Consequently f : X — Y is bijective, that is, it is a one-to-one cor- 
respondence between X and Y. Therefore the inverse mapping f~!: Y > X is 
defined by the formula x = f~!(y) when y = f(x). 

Comparing the definition of the mapping f~! : Y > X with relation (4.1), we 
arrive at the relation 


Vy €¥ Vy €Y (for < f7'62) & v1 <y2), (4.2) 


which means that the function f~! is also increasing on its domain of definition. 
The case when f : X — Y is decreasing on X is obviously handled similarly. 


In accordance with Proposition 2 just proved, if we are interested in the conti- 
nuity of the function inverse to a real-valued function, it is useful to investigate the 
continuity of monotonic functions. 


Proposition 3 The discontinuities of a function f : E — R that is monotonic on the 
set E C Ran be only discontinuities of first kind. 


Proof For definiteness let f be nondecreasing. Assume that a € E is a point of 
discontinuity of f. Since a cannot be an isolated point of EF, a must be a limit point 
of at least one of the two sets ET = {x € E|x <a} and Et ={xe E|x >a}. 
Since f is nondecreasing, for any point x €¢ E, we have f(x) < f(a), and the 
restriction f|,— of f to E; is a nondecreasing function that is bounded from above. 
It then follows that the limit 


lim (flg-)@)= , lim f(x) = fa-0) 


Eq axa 


exists. 
The proof that the limit limg5,;+9+0 f(x) = f(a + 0) exists when a is a limit 
point of E+ is analogous. 
The case when / is a nonincreasing function can be handled either by repeating 
the reasoning just given or passing to the function — f, so as to reduce the question 
to the case already considered. 


Corollary 1 [f a is a point of discontinuity of a monotonic function f : E > R, 
then at least one of the limits 


lim f(x) = f(a—9), lim = f(@)=f(a+0) 
E>x—a—0 E>x—a+0 
exists, and strict inequality holds in at least one of the inequalities f(a — 0) < 
f(a) < f(a +0) when f is nondecreasing and f(a —0) => f(a) => f(a+ 0) when 
fis nonincreasing. The function assumes no values in the open interval defined by 
the strict inequality. Open intervals of this kind determined by different points of 
discontinuity have no points in common. 


4.2 Properties of Continuous Functions 165 


Proof Indeed, if a is a point of discontinuity, it must be a limit point of the set E, 
and by Proposition 3 is a discontinuity of first kind. Thus at least one of the bases 
E>x—>a-—Oand E3x—-a-+0 is defined, and the limit of the function over 
that base exists. (When both bases are defined, the limits over both bases exist.) 
For definiteness assume that f is nondecreasing. Since a is a point of discontinuity, 
strict inequality must actually hold in at least one of the inequalities f(a — 0) < 
f(a) < f(a+0). Since f(x) <limgsx+a_0 f(x) = f(a—0), ifx € E andx <a, 
the open interval (f(a — 0), f(a)) defined by the strict inequality f(a — 0) < f(a) 
is indeed devoid of values of the function. Analogously, since f(a +0) < f(x) if 
x € E anda < x, the open interval (f(a), f(a + 0)) defined by the strict inequality 
f(a) < f(a +0) contains no values of f. 

Let a, and a2 be two different points of discontinuity of f, and assume aj < a2. 
Then, since the function is nondecreasing, 


f(a -—9) <= fla) < flat+9) < f@—-9) < fla) < flm@+9). 


It follows from this that the intervals containing no values of f and corresponding 
to different points of discontinuity are disjoint. 


Corollary 2 The set of points of discontinuity of a monotonic function is at most 
countable. 


Proof With each point of discontinuity of a monotonic function we associate the 
corresponding open interval in Corollary 1 containing no values of f. These inter- 
vals are pairwise disjoint. But on the line there cannot be more than a countable 
number of pairwise disjoint open intervals. In fact, one can choose a rational num- 
ber in each of these intervals, so that the collection of intervals is equipollent with 
a subset of the set Q of rational numbers. Hence it is at most countable. Therefore, 
the set of points of discontinuity, which is in one-to-one correspondence with a set 
of such intervals, is also at most countable. 


Proposition 4 (A criterion for continuity of a monotonic function) A monotonic 
function f : E — R defined on a closed interval E = [a,b] is continuous if and 
only if its set of values f (E) is the closed interval with endpoints f(a) and f (b).° 


Proof If f is a continuous monotonic function, the monotonicity implies that all 
the values that f assumes on the closed interval [a, b] lie between the values f(a) 
and f(b) that it assumes at the endpoints. By continuity, the function must assume 
all the values intermediate between f(a) and f(b). Hence the set of values of a 
function that is monotonic and continuous on a closed interval [a, b] is indeed the 
closed interval with endpoints f(a) and f(b). 

Let us now prove the converse. Let f be monotonic on the closed interval [a, b]. 
If f has a discontinuity at some point c € [a, b], by Corollary | one of the open in- 
tervals ] f(c —0), f(c)[ and ] f(c), f(c + 0)[ is defined and nonempty and contains 


Here f (a) < f(b) if f is nondecreasing, and f(b) < f(a) if f is nonincreasing. 
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no values of f. But, since f is monotonic, that interval is contained in the interval 
with endpoints f(a) and f(b). Hence if a monotonic function has a point of dis- 
continuity on the closed interval [a, b], then the closed interval with endpoints f(a) 
and f(b) cannot be contained in the range of values of the function. 


Theorem 5 (The inverse function theorem) A function f : X — R that is strictly 
monotonic ona set X C R hasan inverse f~! : Y — R defined on the set Y = f (X) 
of values of f. The function f~' : Y > R is monotonic and has the same type of 
monotonicity on Y that f has on X. 

If in addition X is a closed interval [a,b] and f is continuous on X, then the 
set Y = f(X) is the closed interval with endpoints f(a) and f(b) and the function 
f7!:Y = R is continuous on it. 


Proof The assertion that the set Y = f(X) is the closed interval with endpoints 
f(a) and f(b) when X = [a,b] and f is continuous follows from Proposition 4 
proved above. It remains to be verified that f~! : Y > R is continuous. But f~! is 
monotonic on Y, Y is a closed interval, and f —l(Y) = X = [a, b] is also a closed 
interval. We conclude by Proposition 4 that f~! is continuous on the interval Y with 
endpoints f(a) and f(b). 


Example § The function y = f(x) = sinx is increasing and continuous on the 
closed interval [— - 5]. Hence the restriction of the function to the closed interval 
[- a 5] has an inverse x = g-* (y), which we denote x = arcsin y; this function is 
defined on the closed interval [sin(—7), sin(})] = [—1, 1], increases from —7 to 
oe and is continuous on this closed interval. 


Example 9 Similarly, the restriction of the function y = cos x to the closed interval 
[0, 2] is a decreasing continuous function, which by Theorem 5 has an inverse de- 
noted x = arccos y, defined on the closed interval [—1, 1] and decreasing from z to 
0 on that interval. 


Example 10 The restriction of the function y = tanx to the open interval X = 
]—4, 51 is a continuous function that increases from —oo to +00. By the first part 
of Theorem 5 it has an inverse denoted x = arctan y, defined for all y € R, and 
increasing within the open interval ]—4, >[ of its values. To prove that the func- 
tion x = arctan y is continuous at each point yo of its domain of definition, we take 
the point x9 = arctan yo and a closed interval [xo — €, x9 + €] containing xo and 
contained in the open interval ]—7, 5[. If xo — € = arctan(yo — 51) and x9 +e = 
arctan(yo + 62), then for every y € R such that yo — 5) < y < yo +62 we shall have 
Xo — € < arctan y < x9 + €. Hence | arctan y — arctan yo| < ¢ for —d; < y— yo < dp. 
The former inequality holds in particular if |y — yo| < 6 = min{6,, 62}, which veri- 
fies that the function x = arctan y is continuous at the point yo € R. 


Example 11 By reasoning analogous to that of the preceding example, we establish 
that since the restriction of the function y = cotx to the open interval ]0, z[ is a 
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continuous function that decreases from -+-oo to —oo, it has an inverse denoted 
x = arccot y, defined, continuous, and decreasing on the entire real line R from z 
to 0 and assuming values in the range ]0, zr[. 


Remark In constructing the graphs of mutually inverse functions y = f(x) and x = 
f—'!() it is useful to keep in mind that in a given coordinate system the points with 
coordinates (x, f(x)) = (x, y) and (y, #7") = (y, x) are symmetric with respect 
to the bisector of the angle in the first quadrant. 

Thus the graphs of mutually inverse functions, when drawn in the same coordi- 
nate system, are symmetric with respect to this angle bisector. 


4.2.3 Problems and Exercises 


1. Show that 


a) if f © C(A) and B CA, then f|p € C(B); 

b) if a function f : Ej U Ey > R is such that f|z, ¢ C(£;), i = 1, 2, it is not 
always the case that f € C(E) U E2). 

c) the Riemann function R, and its restriction 7e|g to the set of rational numbers 
are both discontinuous at each point of Q except 0, and all the points of discontinuity 
are removable (see Example 12 of Sect. 4.1). 


2. Show that for a function f € C[a, b] the functions 
m(x)= min f(t) and M(x)= max f(t) 
a<t<x a<t<x 


are also continuous on the closed interval [a, b]. 
3. a) Prove that the function inverse to a function that is monotonic on an open 
interval is continuous on its domain of definition. 

b) Construct a monotonic function with a countable set of discontinuities. 

c) Show that if functions f : X > Y and f~!: Y + X are mutually inverse 
(here X and Y are subsets of R), and f is continuous at a point x9 € X, the function 
f—! need not be continuous at yo = f (xo) in Y. 


4. Show that 


a) if f € C[a, b] and g € C[a, db], and, in addition, f(a) < g(a) and f(b) > 
g(b), then there exists a point c € [a, b] at which f(c) = g(c); 

b) any continuous mapping f : [0, 1] — [0, 1] of a closed interval into itself has 
a fixed point, that is, a point x € [0, 1] such that f(x) =x; 

c) if two continuous mappings f and g of an interval into itself commute, that 
is, fog=go f, then they do not always have a common fixed point; 

d) acontinuous mapping f : R— R may fail to have a fixed point; 

e) acontinuous mapping f : JO, 1[ — ]O, 1[ may fail to have a fixed point; 

f) if a mapping f : (0, 1] — [0, 1] is continuous, f(0) =0, fC) = 1, and (f o 
f)(@) =x on [0, 1], then f(x) =x. 
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5. Show that the set of values of any function that is continuous on a closed interval 
is a closed interval. 
6. Prove the following statements. 


a) If a mapping f : [0,1] — [0,1] is continuous, f(0) = 0, fC) = 1, and 

f" (x) := fo---o f(x) =x on [0, 1], then f(x) =x. 
——— 
n factors 

b) If a function f : [0,1] — [0,1] is continuous and nondecreasing, then for 
any point x € [0, 1] at least one of the following situations must occur: either x is a 
fixed point, or f” (x) tends to a fixed point. (Here f” (x) = f o---o f(x) is the nth 
iteration of f.) 


7. Let f : [0,1] ~ R be a continuous function such that f(0) = f (1). Show that 


a) for any n € N there exists a horizontal closed interval of length ‘ with end- 
points on the graph of this function; 

b) if the number / is not of the form i there exists a function of this form on 
whose graph one cannot inscribe a horizontal chord of length /. 


8. The modulus of continuity of a function f : E > R is the function w(6) defined 
for 5 > 0 as follows: 


o(5)= sup |f(x1)— f(x). 


|xy—x2|<6d 
x1,X2€E 
Thus, the least upper bound is taken over all pairs of points x1,x2 of E whose 
distance apart is less than 6. 
Show that 


a) the modulus of continuity is a nondecreasing nonnegative function having the 
limit? @(+0) = lim;_, +9 @(8); 

b) for every ¢ > O there exists 56 > 0 such that for any points x;,x2 € E the 
relation |x; — x2| < 6 implies | f(x1) — f(x2)| < @(40) +6; 

c) if E is aclosed interval, an open interval, or a half-open interval, the relation 


w(5, + 62) < w(61) + @(52) 


holds for the modulus of continuity of a function f : E > R; 

d) the moduli of continuity of the functions x and sin(x) on the whole real axis 
are respectively w(é6) = 6 and the constant w(5) = 2 in the domain 6 > 0; 

e) afunction f is uniformly continuous on E if and only if w(+0) = 0. 


9. Let f and g be bounded functions defined on the same set X. The quantity 
A=sup,ey | f(x) — g(x)| is called the distance between f and g. It shows how well 
one function approximates the other on the given set X. Let X be a closed interval 
[a, b]. Show that if f, g € C[a, b], then 4x0 € [a, b], where A = | f (xo) — g(xo)|, 
and that such is not the case in general for arbitrary bounded functions. 


7For this reason the modulus of continuity is usually considered for 5 > 0, setting w(0) = w(+0). 
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10. Let P,, (x) be a polynomial of degree n. We are going to approximate a bounded 
function f : [a,b] > R by polynomials. Let 


A(P,)= sup | f(x) — P,(x)| and En(f) = inf A(Pr), 


x€[a,b] 


where the infimum is taken over all polynomials of degree n. A polynomial P,, is 
called a polynomial of best approximation of f if A(P,) = En(f). 
Show that 


a) there exists a polynomial Po(x) = ao of best approximation of degree zero; 
b) among the polynomials Q, (x) of the form 2 P,,(x), where P,, is a fixed poly- 
nomial, there is a polynomial Q;,, such that 


A(Qig) = min A(Q,); 


c) if there exists a polynomial of best approximation of degree n, there also 
exists a polynomial of best approximation of degree n + 1; 

d) for any bounded function on a closed interval and any n = 0, 1, 2,... there 
exists a polynomial of best approximation of degree n. 


11. Prove the following statements. 


a) A polynomial of odd degree with real coefficients has at least one real root. 

b) If P, is a polynomial of degree n, the function sgn P, (x) has at most n points 
of discontinuity. 

c) If there are n + 2 points x9 < x; < +--+ < X,+41 in the closed interval [a, b] 
such that the quantity 


sgn[ (f(x) — Pn(xi))(-1)'] 


assumes the same value fori = 0,...,n + 1, then E,(f) > ming<j<n+1 | f (xi) — 
P,,(x;)|. (This result is known as Vallée Poussin’s theorem.® For the definition of 
E,(f) see Problem 10.) 


12. a) Show that for any n € N the function T,,(x) = cos(narccosx) defined on 
the closed interval [—1, 1] is an algebraic polynomial of degree n. (These are the 
Chebyshev polynomials.) 

b) Find an explicit algebraic expression for the polynomials 7;, T>, 73, and T 
and draw their graphs. 

c) Find the roots of the polynomial T,,(x) on the closed interval [—1, 1] and 
the points of the interval where |7;,(x)| assumes its maximum value. 

d) Show that among all polynomials P,,(x) of degree n whose leading coef- 
ficient is 1 the polynomial 7, (x) is the unique polynomial closest to zero, that is, 
E,,(O) = max)x|<1 |Z (x)|. (For the definition of E,(f) see Problem 10.) 


8Ch.J. de la Vallée Poussin (1866-1962) — Belgian mathematician and specialist in theoretical 
mechanics. 
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13. Let f € Cla, bd]. 


a) Show that if the polynomial P, (x) of degree n is such that there are n + 2 
points x9 <--- < Xn41 (called Chebyshev alternant points) for which f(x;) — 
P,(x;) = (—1)!'A(P,) - a, where A(P,) = maxxe[a,b] | f(x) — Pn(x)| and @ is a 
constant equal to | or —1, then P,,(x) is the unique polynomial of best approxima- 
tion of degree n to f (see Problem 10). 

b) Prove Chebyshev’s theorem: A polynomial P,(x) of degree n is a polynomial 
of best approximation to the function f € C[a, b] if and only if there are at least 
n+ 2 Chebyshev alternant points on the closed interval [a, b]. 

c) Show that for discontinuous functions the preceding statement is in general 
not true. 

d) Find the polynomials of best approximation of degrees zero and one for the 
function |x| on the interval [—1, 2]. 


14. In Sect. 4.2 we discussed the local properties of continuous functions. The 
present problem makes the concept of a local property more precise. 

Two functions f and g are considered equivalent if there is a neighborhood U (a) 
of a given point a € R such that f(x) = g(x) for all x € U(a). This relation between 
functions is obviously reflexive, symmetric, and transitive, that is, it really is an 
equivalence relation. 

A class of functions that are all equivalent to one another at a point a is called 
a germ of functions at a. If we consider only continuous functions, we speak of a 
germ of continuous functions at a. 

The local properties of functions are properties of the germs of functions. 


a) Define the arithmetic operations on germs of numerical-valued functions de- 
fined at a given point. 

b) Show that the arithmetic operations on germs of continuous functions do not 
lead outside this class of germs. 

c) Taking account of a) and b), show that the germs of continuous functions 
form a ring — the ring of germs of continuous functions. 

d) Asubring J of aring K is called an ideal of K if the product of every element 
of the ring K with an element of the subring J belongs to /. Find an ideal in the 
ring of germs of continuous functions at a. 


15. An ideal in a ring is maximal if it is not contained in any larger ideal except 
the ring itself. The set C[a, b] of functions continuous on a closed interval forms a 
ring under the usual operations of addition and multiplication of numerical-valued 
functions. Find the maximal ideals of this ring. 


Chapter 5 
Differential Calculus 


5.1 Differentiable Functions 
5.1.1 Statement of the Problem and Introductory Considerations 


Suppose, following Newton,! we wish to solve the Kepler problem? of two bodies, 
that is, we wish to explain the law of motion of one celestial body m (a planet) rela- 
tive to another body M (a star). We take a Cartesian coordinate system in the plane 
of motion with origin at M (Fig. 5.1). Then the position of m at time t can be char- 
acterized numerically by the coordinates (x(t), y(¢)) of the point in that coordinate 
system. We wish to find the functions x(t) and y(f). 

The motion of m relative to M is governed by Newton’s two famous laws: the 
general law of motion 


ma=F, (5.1) 


connecting the force vector with the acceleration vector that it produces via the 
coefficient of proportionality m — the inertial mass of the body,’ and the law of 
universal gravitation, which makes it possible to find the gravitational action of the 
bodies m and M on each other according to the formula 


mM 


F=G—,r, 
Ir| 


(5.2) 


1T. Newton (1642-1727) — British physicist, astronomer, and mathematician, an outstanding 
scholar, who stated the basic laws of classical mechanics, discovered the law of universal grav- 
itation, and developed (along with Leibniz) the foundations of differential and integral calculus. 
He was appreciated even by his contemporaries, who inscribed on his tombstone: “Hic depositum 
est, quod mortale fuit Isaaci Newtoni” (here lies what was mortal of Isaac Newton). 


?J. Kepler (1571-1630) — famous German astronomer who discovered the laws of motion of the 
planets (Kepler’s laws). 
3We have denoted the mass by the same symbol we used for the body itself, but this will not lead 


to any confusion. We remark also that if m < M, the coordinate system chosen can be considered 
inertial. 
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Fig. 5.1 


where r is a vector with its initial point in the body to which the force is applied and 
its terminal point in the other body and |r| is the length of the vector r, that is, the 
distance between m and M. 

Knowing the masses m and M, we can easily use Eq. (5.2) to express the right- 
hand side of Eq. (5.1) in terms of the coordinates x(t) and y(t) of the body m at 
time ¢, and thereby take account of all the data for the given motion. 

To obtain the relations on x(t) and y(t) contained in Eq. (5.1), we must learn 
how to express the left-hand side of Eq. (5.1) in terms of x(t) and y(t). 

Acceleration is a characteristic of a change in velocity v(t). More precisely, it is 
simply the rate at which the velocity changes. Therefore, to solve the problem we 
must first of all learn how to compute the velocity v(t) at time ¢ possessed by a body 
whose motion is described by the radius-vector r(t) = (x(t), y(¢)). 

Thus we wish to define and learn how to compute the instantaneous velocity of a 
body that is implicit in the law of motion (5.1). 

To measure a thing is to compare it to a standard. In the present case, what can 
serve as a standard for determining the instantaneous velocity of motion? 

The simplest kind of motion is that of a free body moving under inertia. This is 
a motion under which equal displacements of the body in space (as vectors) occur 
in equal intervals of time. It is the so-called uniform (rectilinear) motion. If a point 
is moving uniformly, and r(O) and r(1) are its radius-vectors relative to an inertial 
coordinate system at times t = 0 and t = | respectively, then at any time ¢ we shall 
have 


r(t) —r(O) =v-t, (5.3) 


where v = r(1) — r(0). Thus the displacement r(t) — r(O) turns out to be a linear 
function of time in this simplest case, where the role of the constant of proportional- 
ity between the displacement r(t) — r(O) and the time f is played by the vector v that 
is the displacement in unit time. It is this vector that we call the velocity of uniform 
motion. The fact that the motion is rectilinear can be seen from the parametric rep- 
resentation of the trajectory: r(t) = r(0) +. v- t, which is the equation of a straight 
line, as you will recall from analytic geometry. 

We thus know the velocity v of uniform rectilinear motion given by Eq. (5.3). By 
the law of inertia, if no external forces are acting on a body, it moves uniformly in a 
straight line. Hence if the action of M on m were to cease at time f, the latter would 
continue its motion, in a straight line at a certain velocity from that time on. It is 
natural to regard that velocity as the instantaneous velocity of the body at time f. 
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However, such a definition of instantaneous velocity would remain a pure ab- 
straction, giving us no guidance for explicit computation of the quantity, if not for 
the circumstance of primary importance that we are about to discuss. 

While remaining within the circle we have entered (logicians would call it a “vi- 
cious” circle) when we wrote down the equation of motion (5.1) and then undertook 
to determine what is meant by instantaneous velocity and acceleration, we never- 
theless remark that, even with the most general ideas about these concepts, one can 
draw the following heuristic conclusions from Eq. (5.1). If there is no force, that is, 
F = 0, then the acceleration is also zero. But if the rate of change a(t) of the velocity 
v(t) is zero, then the velocity v(t) itself must not vary over time. In that way, we 
arrive at the law of inertia, according to which the body indeed moves in space with 
a velocity that is constant in time. 

From this same Eq. (5.1) we can see that forces of bounded magnitude are capa- 
ble of creating only accelerations of bounded magnitude. But if the absolute magni- 
tude of the rate of change of a quantity P(t) over a time interval [0, t] does not ex- 
ceed some constant c, then, in our picture of the situation, the change | P(t) — P(0)| 
in the quantity P over time t cannot exceed c -f, that is, in this situation, the quantity 
changes by very little in a small interval of time. (In any case, the function P(t) turns 
out to be continuous.) Thus, in a real mechanical system the parameters change by 
small amounts over a small time interval. 

In particular, at all times t close to some time to the velocity v(t) of the body 
m must be close to the value v(tg) that we wish to determine. But in that case, in 
a small neighborhood of the time to the motion itself must differ by only a small 
amount from uniform motion at velocity V(to), and the closer to fo, the less it differs. 

If we photographed the trajectory of the body m through a telescope, depend- 
ing on the power of the telescope, we would see approximately what is shown in 
Fig. 5.2. 

The portion of the trajectory shown in Fig. 5.2c corresponds to a time interval so 
small that it is difficult to distinguish the actual trajectory from a straight line, since 
this portion of the trajectory really does resemble a straight line, and the motion 
resembles uniform rectilinear motion. From this observation, as it happens, we can 
conclude that by solving the problem of determining the instantaneous velocity (ve- 
locity being a vector quantity) we will at the same time solve the purely geometric 
problem of defining and finding the tangent to a curve (in the present case the curve 
is the trajectory of motion). 

Thus we have observed that in this problem we must have v(t) © v(¢o) for t 
close to fo, that is, v(t) > v(to) as t > fg, or, what is the same, v(t) = v(to) + o(1) 
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as t > fo. Then we must also have 
r(t) — r(to) © V(to) - (¢ — to) 


for t close to fg. More precisely, the value of the displacement r(t) — r(¢p) is equiv- 
alent to v(to)(t — to) as t > fg, or 


r(t) — r(to) = v(to)(t — to) + 0(v(to) (t — t0)), (5.4) 


where o(v(tg)(t — fo)) is a correction vector whose magnitude tends to zero faster 
than the magnitude of the vector v(to)(t — to) as t > to. Here, naturally, we must 
except the case when v(tg) = 0. So as not to exclude this case from consideration in 
general, it is useful to observe that* |v(to) (t — to) | = |V(to)||t — to|. Thus, if |v(to)| 4 
0, then the quantity |v(to)(t — fo)| is of the same order as |t — fo|, and therefore 
o(Vv(to)(t — to)) = o(t — to). Hence, instead of (5.4) we can write the relation 


r(t) — r(to) = V(to)(t — fo) + o(t — to), (5.5) 


which does not exclude the case v(fp) = 0. 

Thus, starting from the most general, and perhaps vague ideas about velocity, we 
have arrived at Eq. (5.5), which the velocity must satisfy. But the quantity v(to) can 
be found unambiguously from Eq. (5.5): 


Ope (5.6) 
i> = tt — ft 
Therefore both the fundamental relation (5.5) and the relation (5.6) equivalent to it 
can now be taken as the definition of the quantity v(fo), the instantaneous velocity 
of the body at time fo. 

At this point we shall not allow ourselves to be distracted into a detailed discus- 
sion of the problem of the limit of a vector-valued function. Instead, we shall confine 
ourselves to reducing it to the case of the limit of a real-valued function, which has 
already been discussed in complete detail. Since the vector r(t) — r(fo) has coor- 
dinates (x(t) — x(to), y(t) — y(to)), we have =F) = (xQ— x0) =r) and 
hence, if we regard vectors as being close together if their coordinates are close 
together, the limit in (5.6) should be interpreted as follows: 


r(t) —r(fo) _ (im x(t) — x(t) ‘mn y(t) — “) 


, li 
t> to t to t>10 t to 


V(to) = lim 
i> ~=6t —t0 


and the term o(t — fo) in (5.5) should be interpreted as a vector depending on ¢ such 
that the vector mtg) tends (coordinatewise) to zero as t > f0. 


*Here |t — fo| is the absolute value of the number t — fo while |v| is the absolute value, or length 
of the vector v. 
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Finally, we remark that if v(to) 4 0, then the equation 
r—r(to) = V(to) - (t — to) (5.7) 


defines a line, which by the circumstances indicated above should be regarded as 
the tangent to the trajectory at the point (x (fo), y(to)). 

Thus, the standard for defining the velocity of a motion is the velocity of uniform 
rectilinear motion defined by the linear relation (5.7). The standard motion (5.7) is 
connected with the motion being studied as shown by relation (5.5). The value v(to) 
at which (5.5) holds can be found by passing to the limit in (5.6) and is called the 
velocity of motion at time to. The motions studied in classical mechanics, which are 
described by the law (5.1), must admit comparison with this standard, that is, they 
must admit of the linear approximation indicated in (5.5). 

If r(t) = (x(t), y(t)) is the radius-vector of a moving point m at time f¢, then 
r(t) = (x(t), y(t)) = v(t) is the vector that gives the rate of change of r(t) at time f, 
and r(t) = (x(t), y(t) = a(t) is the vector that gives the rate of change of v(t) 
(acceleration) at time ¢, then Eq. (5.1) can be written in the form 


m-¥(t) = F(t), 


from which we obtain in coordinate form for motion in a gravitational field 


X(t) = —GM -5 @ 32> 
[x-(@)+y7(1)] (5.8) 


3(1) = GM 


(t) 
PLO SETIOTLEE 


This is a precise mathematical expression of our original problem. Since we 
know how to find r(t) from r(t) and then how to find r(t), we are already in a 
position to answer the question whether a pair of functions (x(t), y(t)) can describe 
the motion of the body m about the body M. To answer this question, one must find 
X(t) and y(t) and check whether Eqs. (5.8) hold. The system (5.8) is an example of 
a system of so-called differential equations. At this point we can only check whether 
a set of functions is a solution of the system. How to find the solution or, better ex- 
pressed, how to investigate the properties of solutions of differential equations, is 
studied in a special and, as one can now appreciate, critical area of analysis — the 
theory of differential equations. 

The operation of finding the rate of change of a vector quantity, as has been 
shown, reduces to finding the rates of change of several numerical-valued functions 
— the coordinates of the vector. Thus we must first of all learn how to carry out 
this operation easily in the simplest case of real-valued functions of a real-valued 
argument, which we now take up. 


5.1.2 Functions Differentiable at a Point 


We begin with two preliminary definitions that we shall shortly make precise. 


176 5 Differential Calculus 


Definition 0; A function f : E — R defined on a set E C R is differentiable at a 
point a € E that is a limit point of E if there exists a linear function A - (x — a) of 
the increment x — a of the argument such that f(x) — f(a) can be represented as 


fa) -f@M=A-(-a)+o-a) asx>a,xeE. (5.9) 


In other words, a function is differentiable at a point a if the change in its values 
in a neighborhood of the point in question is linear up to a correction that is infinites- 
imal compared with the magnitude of the displacement x — a from the point a. 


Remark As atrule we have to deal with functions defined in an entire neighborhood 
of the point in question, not merely on a subset of the neighborhood. 


Definition 02 The linear function A - (x — a) in Eq. (5.9) is called the differential 
of the function f at a. 

The differential of a function at a point is uniquely determined; for it follows 
from (5.9) that 


lim LOD=f@ _ lim (442) =4, 
E>x—>a x—-a E>x—-a x—-a 

so that the number A is unambiguously determined due to the uniqueness of the 

limit. 

Definition 1 The number 


f'@ = Jim f£&)— f@) (5.10) 


xa X—-a 


is called the derivative of the function f at a. 


Relation (5.10) can be rewritten in the equivalent form 


f(x) — f@ 


X—a 


= f'(a)+ a(x), 


where a(x) > 0 as x > a,x € E, which in turn is equivalent to 
f@~)-f@= f'@(«—-a)+o(x-a) asx>a,xeE. (5.11) 


Thus, differentiability of a function at a point is equivalent to the existence of its 
derivative at the same point. 

If we compare these definitions with what was said in Sect. 5.1.1, we can con- 
clude that the derivative characterizes the rate of change of a function at the point 
under consideration, while the differential provides the best linear approximation to 
the increment of the function in a neighborhood of the same point. 

If a function f : E > R is differentiable at different points of the set EF, then in 
passing from one point to another both the quantity A and the function o(x — a) in 
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Eq. (5.9) may change (a result at which we have already arrived explicitly in (5.11)). 
This circumstance should be noted in the very definition of a differentiable function, 
and we now write out this fundamental definition in full. 


Definition 2 A function f : E — R defined on a set E C R is differentiable at a 
point x € E that is a limit point of F if 


fath)— fax) =AWh+a(x;h), (5.12) 


where h +> A(x)h is a linear function in h and a(x; h) =o(h) ash>0,x+heE. 


The quantities 
Ax(h) :=(x+h)-—x=h 
and 
Af (x; h):= ft+h)— f(x) 


are called respectively the increment of the argument and the increment of the func- 
tion (corresponding to this increment in the argument). 

They are often denoted (not quite legitimately, to be sure) by the symbols Ax 
and Af (x) representing functions of h. 

Thus, a function is differentiable at a point if its increment at that point, regarded 
as a function of the increment h in its argument, is linear up to a correction that is 
infinitesimal compared to h as h > 0. 


Definition 3 The function h +> A(x)hA of Definition 2, which is linear in h, is called 
the differential of the function f : E — R at the point x € E and is denoted df (x) 
or Df (x). 


Thus, df (x)(h) = A(x)h. 
From Definitions 2 and 3 we have 


Af (x; h) — df (x)(h) = a(x; h), 


and a(x; h) = o(h) ash > 0, x +h e E;; that is, the difference between the incre- 
ment of the function due to the increment / in its argument and the value of the 
function df (x), which is linear in h, at the same h, is an infinitesimal of higher 
order than the first in h. 

For that reason, we say that the differential is the (principal) linear part of the 
increment of the function. 

As follows from relation (5.12) and Definition 1, 


f@ +h) — f@) 


A(x) = f’(x) = A hy 


x+h,xeE 
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and so the differential can be written as 
df (x)(h) = f'(x)h. (5.13) 
In particular, if f(x) =x, we obviously have f’(x) = 1 and 
dx(h)=1-h=h, 


so that it is sometimes said that “the differential of an independent variable equals 
its increment”. 
Taking this equality into account, we deduce from (5.13) that 


df (x)(h) = f'(x) dx(h), (5.14) 
that is, 
df (x) = f'(x) dx. (5.15) 


The equality (5.15) should be understood as the equality of two functions of h. 
From (5.14) we obtain 


df (x)(h) 


dx(h) =f (x), (5.16) 


that is, the function of @) (the ratio of the functions df (x) and dx) is constant and 
equals f’(x). For this reason, following Leibniz, we frequently denote the derivative 
by the symbol df) , alongside the notation f’(x) proposed by Lagrange.> 

In mechanics, in addition to these symbols, the symbol g(t) (read “phi-dot of t”’) 


is also used to denote the derivative of the function g(t) with respect to time f. 


5.1.3 The Tangent Line; Geometric Meaning of the Derivative 
and Differential 


Let f : E > R bea function defined ona set E C R and xo a given limit point of E. 
We wish to choose the constant co so as to give the best possible description of the 
behavior of the function in a neighborhood of the point x9 among constant functions. 
More precisely, we want the difference f(x) — co to be infinitesimal compared with 
any nonzero constant as x > xo, x € E, that is 


f@)=cotoU) asx>x0,x EE. (5.17) 


This last relation is equivalent to saying limes; x, f(x) = co. If, in particular, 
the function is continuous at xo, then limgsx+x) f(x) = f (xo), and naturally co = 


f (xo). 


5J.L. Lagrange (1736-1831) — famous French mathematician and specialist in theoretical mechan- 
ics. 
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Now let us try to choose the function cp + c; (x — x9) so as to have 
f(x) =coteci(x —x9) +o(x— x0) asx—>x0,x EE. (5.18) 


This is obviously a generalization of the preceding problem, since the formula (5.17) 
can be rewritten as 


f(x) = co + 0((x — x0)°) asx—>xo,x EE. 


It follows immediately from (5.18) that co = limgsx—+x,) f(x), and if the function 
is continuous at this point, then co = f (xo). 
If co has been found, it then follows from (5.18) that 


f(x) — co 
c= lim ——. 
E>x>x X—X0 
And, in general, if we were seeking a polynomial P,,(xo; x) = co + c1(% — x0) + 


-++ + ¢,(x — x9)” such that 


f(x) =coter(x — x0) +++ + en (x — x0)" + o((x — xo)") asx —>x0,x ce E, 
(5.19) 
we would find successively, with no ambiguity, that 


co=_lim f(x), 
E>x—>xg 
_  f(x)— co 
c= _lim ——, 


E>x>x9 X— Xo 


f(x) — [eo + +++ + en—1(@ — x0)"7"] 
E>x—>xo (x — xo)” 


’ 


assuming that all these limits exist. Otherwise condition (5.19) cannot be fulfilled, 
and the problem has no solution. 

If the function f is continuous at xo, it follows from (5.18), as already pointed 
out, that co = f (xo), and we then arrive at the relation 


f(x) — f(%o) =c1(x — x9) +o(x— x9) asx x0,x EE, 


which is equivalent to the condition that f(x) be differentiable at xo. 
From this we find 


a f@) = f G0) 
cq = lim M——~— 


E5x—>xo xX — XO 


= f' (x0). 
We have thus proved the following proposition. 
Proposition 1 A function f : E — R that is continuous at a point xo € E that is 


a limit point of E C R admits a linear approximation (5.18) if and only if it is 
differentiable at the point. 
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y 
y = f(z) 
y~ f(a) = Leet) = 9) (g 29) 
y — f (xo) = f'(xo)( — 20) 
f(@o +h) p}-----------------3 
eee ee Af (xo; h) 
f(@0)$-------5, ' 
[ @ 
0 Xo Lot+th x 
Fig. 5.3 
The function 
g(x) =co +. c1(* — Xo) (5.20) 
with co = f (xo) and cy = f’ (xo) is the only function of the form (5.20) that satisfies 
(5.18). 
Thus the function 
g(x) = f (xo) + f’ (x0) (x — x0) (5.21) 


provides the best linear approximation to the function f in a neighborhood of xo in 
the sense that for any other function g(x) of the form (5.20) we have f(x) — g(x) # 
o(x — x9) asx > x0, x EE. 

The graph of the function (5.21) is the straight line 


y — f (xo) = f' (x0) (x — x0), (5.22) 


passing through the point (xo, f(xo)) and having slope f’(xo). 

Since the line (5.22) provides the optimal linear approximation of the graph of 
the function y = f(x) in a neighborhood of the point (xo, f(xo)), it is natural to 
make the following definition. 


Definition 4 If a function f : E — R is defined on a set E C R and differentiable 
at a point xo € E, the line defined by Eq. (5.22) is called the tangent to the graph of 
this function at the point (xo, f (xo)). 

Figure 5.3 illustrates all the basic concepts we have so far introduced in connec- 
tion with differentiability of a function at a point: the increment of the argument, 
the increment of the function corresponding to it, and the value of the differential. 
The figure shows the graph of the function, the tangent to the graph at the point 
Po = (xo, f (xo)), and for comparison, an arbitrary line (usually called a secant) 
passing through Pp and some point P ¥ Pp of the graph of the function. 

The following definition extends Definition 4. 
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Definition 5 If the mappings f : E — R and g: E > R are continuous at a point 
xo € E that is a limit point of E and f(x) — g(x) =o((x —x0)") asx > x0, x € E, 
we Say that f and g have nth order contact at xo (more precisely, contact of order 
at least n). 

For n = | we say that the mappings f and g are tangent to each other at xo. 


According to Definition 5 the mapping (5.21) is tangent at x9 to a mapping f : 
E = R that is differentiable at that point. 

We can now also say that the polynomial P, (x9; x) = co + c1(x — X90) +--+ + 
Cy(x — x9)” of relation (5.19) has contact of order at least n with the function /. 

The number h = x — xo, that is, the increment of the argument, can be regarded 
as a vector attached to the point xo and defining the transition from xo to x = x9 +h. 
We denote the set of all such vectors by TR(xo) or TRy, > Similarly, we denote by 
TR(yo) or TRy, the set of all displacement vectors from the point yo along the y- 
axis (see Fig. 5.3). It can then be seen from the definition of the differential that the 
mapping 

df (xo): TR) > TR(f (x0), (5.23) 


defined by the differential ht» f'(xo)h = df (xo)(A) is tangent to the mapping 
hr> f(xo +h) — fo) = Af Qo; h), (5.24) 


defined by the increment of a differentiable function. 

We remark (see Fig. 5.3) that if the mapping (5.24) is the increment of the ordi- 
nate of the graph of the function y = f(x) as the argument passes from xo to x9 +h, 
then the differential (5.23) gives the increment in the ordinate of the tangent to the 
graph of the function for the same increment / in the argument. 


5.1.4 The Role of the Coordinate System 


The analytic definition of a tangent (Definition 4) may be the cause of some vague 
uneasiness. We shall try to state what it is exactly that makes one uneasy. However, 
we shall first point out a more geometric construction of the tangent to a curve at 
one of its points Pp (see Fig. 5.3). 

Take an arbitrary point P of the curve different from Po. The line determined 
by the pair of points Po and P, as already noted, is called a secant in relation to 
the curve. We now force the point P to approach Po along the curve. If the secant 
tends to some limiting position as we do so, that limiting position of the secant is 
the tangent to the curve at Po. 

Despite its intuitive nature, such a definition of the tangent is not available to us 
at the moment, since we do not know what a curve is, what it means to say that 


©This is a slight deviation from the more common notation 7) R or Ty) (R). 
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“a point tends to another point along a curve”, and finally, in what sense we are to 
interpret the phrase “limiting position of the secant”. 

Rather than make all these concepts precise, we point out a fundamental differ- 
ence between the two definitions of tangent that we have introduced. The second 
was purely geometric, unconnected (at least until it is made more precise) with 
any coordinate system. In the first case, however, we have defined the tangent to 
a curve that is the graph of a differentiable function in some coordinate system. 
The question naturally arises whether, if the curve is written in a different coordi- 
nate system, it might not cease to be differentiable, or might be differentiable but 
yield a different line as tangent when the computations are carried out in the new 
coordinates. 

This question of invariance, that is, independence of the coordinate system, al- 
ways arises when a concept is introduced using a coordinate system. The ques- 
tion applies in equal measure to the concept of velocity, which we discussed in 
Sect. 5.1.1 and which, as we have mentioned already, includes the concept of a 
tangent. 

Points, vectors, lines, and so forth have different numerical characteristics in dif- 
ferent coordinate systems (coordinates of a point, coordinates of a vector, equation 
of a line). However, knowing the formulas that connect two coordinate systems, one 
can always determine from two numerical representations of the same type whether 
or not they are expressions for the same geometric object in different coordinate 
systems. Intuition suggests that the procedure for defining velocity described in 
Sect. 5.1.1 leads to the same vector independently of the coordinate system in which 
the computations are carried out. At the appropriate time in the study of functions 
of several variables we shall give a detailed discussion of questions of this sort. The 
invariance of the definition of velocity with respect to different coordinate systems 
will be verified in the next section. 

Before passing to the study of specific examples, we now summarize some of the 
results. 

We have encountered the problem of the describing mathematically the instanta- 
neous velocity of a moving body. 

This problem led us to the problem of approximating a given function in the 
neighborhood of a given point by a linear function, which on the geometric level led 
to the concept of the tangent. Functions describing the motion of a real mechanical 
system are assumed to admit such a linear approximation. 

In this way we have distinguished the class of differentiable functions in the class 
of all functions. 

The concept of the differential of a function at a point has been introduced. The 
differential is a linear mapping defined on displacements from the point under con- 
sideration that describes the behavior of the increment of a differentiable function 
in a neighborhood of the point, up to a quantity that is infinitesimal in comparison 
with the displacement. 

The differential df (xo)h = f’(xo)h is completely determined by the number 
f' (xo), the derivative of the function f at xg, which can be found by taking the 
limit 
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Pio dts OECD). 


E>x—>xo x — Xo 


The physical meaning of the derivative is the rate of change of the quantity f(x) 
at time xg; its geometrical meaning is the slope of the tangent to the graph of the 
function y = f (x) at the point (xo, f(xo)). 


5.1.5 Some Examples 


Example I Let f(x) =sinx. We shall show that f’(x) = cos x. 


Proof 


_ sine +h)—sinx |, 2sin($) cos(x + 4) 
lim = lim = 
h>0 h h>0 h 


h\ .._ sin) 
= lim cos{x+ = ]- lim =cosx. 
h->0 2 


Here we have used the theorem on the limit of a product, the continuity of the 
function cos x, the equivalence sint ~ ¢ as t — 0, and the theorem on the limit of a 
composite function. 


Example 2 We shall show that cos’ x = — sinx. 


Proof 


cos(x + h) — cosx i —2 sin(4) sin(x + a) 
= lim = 
h>0 h h>0 h 


- 7h 
h sin(> 

= — lim sin{ x + — )- lim () 
h—>0 2 4 


Example 3 We shall show that if f(t) =rcosart, then f’(t) = —rqmsinat. 


Proof 
. reosw(t+h)—rcosat : —2sin($") sina + 4) 
lim =r lim = 
h>0 h h>0 h 
ee h |. sin(#) 
= —ro lim sinw(t+ —)- lim — 
h>0 2° h-0 (St) 


=-rosinoat. 


Example 4 If f(t) =rsinot, then f’(t)=racosat. 
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Proof The proof is analogous to that of Examples | and 3. 


Example 5 (The instantaneous velocity and instantaneous acceleration of a point 
mass) Suppose a point mass is moving in a plane and that in some given coordinate 
system its motion is described by differentiable functions of time 


x=x(t), y=y(t) 
or, what is the same, by a vector 
r(t) = (x(t), y). 
As we have explained in Sect. 5.1.1, the velocity of the point at time rf is the vector 
v(t) =F) = (4), 9), 


where x(t) and y(t) are the derivatives of x(t) and y(t) with respect to time rf. 
The acceleration a(t) is the rate of change of the vector v(t), so that 


a(t) = W(t) =F) = (4), FO), 


where x(t) and y(t) are the derivatives of the functions x(t) and y(t) with respect 
to time, the so-called second derivatives of x(t) and y(t). 

Thus, in the sense of the physical problem, functions x(t) and y(t) that describe 
the motion of a point mass must have both first and second derivatives. 

In particular, let us consider the uniform motion of a point along a circle of 
radius r. Let w be the angular velocity of the point, that is, the magnitude of the 
central angle over which the point moves in unit time. 

In Cartesian coordinates (by the definitions of the functions cosx and sin x) this 
motion is written in the form 


r(t) = (r cos(wt + a), r sin(wt + @)), 
and if r(0) = (r, 0), it assumes the form 
r(t) = (rcosat,rsinat). 


Without loss of generality in our subsequent deductions, for the sake of brevity, 
we shall assume that r(0) = (r, 0). 
Then by the results of Examples 3 and 4 we have 


v(t) =V(t) = (—rwsinwt, rwcosat). 
From the computation of the inner product 
(v(t), r(t)) = —r’wsinot coswt +r?@cos wt sinwt = 0, 


as one should expect in this case, we find that the velocity vector v(t) is orthogonal 
to the radius-vector r(t) and is therefore directed along the tangent to the circle. 
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Next, for the acceleration, we have 
a(t) = v(t) =r(t) = (-ro cos a@t, —rw sin wt), 
that is, a(t) = —w7r(t), and the acceleration is thus indeed centripetal, since it has 
the direction opposite to that of the radius-vector r(t). 
Moreover, 
hic) a 
Ja(t)| = @*|r()| = 0? = oS ae? 


where v = |v(ft)|. 

Starting from these formulas, let us compute, for example, the speed of a low- 
altitude satellite of the Earth. In this case r equals the radius of the earth, that is, 
r = 6400 km, while |a(t)| = g, where g © 10 m/s? is the acceleration of free fall at 
the surface of the earth. 

Thus, v* = |a(t)|r © 10 m/s? x 64 x 10° m= 64 x 10° (m/s), and so v © 
8 x 10° m/s. 


Example 6 (The optic property of a parabolic mirror) Let us consider the parabola 


y= ape (p > 0, see Fig. 5.4), and construct the tangent to it at the point (xo, yo) = 


1 
(xo, ap%0): 
Since f(x) = ap* we have 


1 2 142 
5X — EX l 
f(a) = lim 7? 2p") _ — tim (x +.x9) = —x0. 
xX—>XxQ x — x9 2p x—>Xx9 Dp 


Hence the required tangent has the equation 


1 
2p 


2_ 1 
y XQ = —X0(x — Xo) 
Pp 


or 


1 
ao xo) — (y — yo) = 9, (5.25) 


where yo = ap*a: 
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The vector n = (— 1x, 1), as can be seen from this last equation, is orthogonal 
to the line whose equation is (5.25). We shall show that the vectors ey = (0, 1) 
and ef = (—x0, f — yo) form equal angles with n. The vector ey is a unit vector 
directed along the y-axis, while e is directed from the point of tangency (xo, yo) = 
(xo, 7%) to the point (0, 2), which is the focus of the parabola. Thus 


(ey,n) 1 
cos eyn = =a 
ley|[n| | 
I 2 Pp I 2 Pp 1.2 
(e7, n) p*0 + 2 — pO z+ 75% 1 
cose sn = 


aciet = = Le T2\2 In| 
levi inj xe +B — shad? im + bad? nl 


Thus we have shown that a wave source located at the point (0, 4), the focus of 
the parabola, will emit a ray parallel to the axis of the mirror (the y-axis), and that 
a wave arriving parallel to the axis of the mirror will pass through the focus (see 
Fig. 5.4). 


Example 7 With this example we shall show that the tangent is merely the best 
linear approximation to the graph of a function in a neighborhood of the point of 
tangency and does not necessarily have only one point in common with the curve, 
as was the case with a circle, or in general, with convex curves. (For convex curves 
we shall give a separate discussion.) 

Let the function be given by 


x?sint, ifx 40 
x = x? ’ 
ie 0 ifx =0. 


The graph of this function is shown by the thick line in Fig. 5.5. 
Let us find the tangent to the graph at the point (0, 0). Since 


; x sin 1 —0 . 1 
Ff ©) = lim ———— = lim x sin- =0, 
x0 x—0 x= x 
the tangent has the equation y —0 = 0- (x — 0), or simply y = 0. 
Thus, in this example the tangent is the x-axis, which the graph intersects in- 
finitely many times in any neighborhood of the point of tangency. 
By the definition of differentiability of a function f : E — R at a point x9 € E, 
we have 


Ff) — fo) = A(xo) (x — x9) tO(X — XQ) asx x0,x Ee E. 


Since the right-hand side of this equality tends to zero as x > xo, x € E, it 
follows that limgzs;—x) f(x) = f(%o), so that a function that is differentiable at a 
point is necessarily continuous at that point. 

We shall show that the converse, of course, is not always true. 
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Example 8 Let f(x) = |x|, (Fig. 5.6). Then at the point x9 = 0 we have 


_  f£()— fo) . Ai=0 co: 

lim =—— = lim ——e= lim —=-l, 
x—>x9—0 xX — X09 x>-0 x -—0 x>-0 X 

_ Sf) — fo) -. l= 0 a: 

lim =———— = lim ——e= lim —-=1. 
x—>x9t+0 x — XO x>+0 x —0 x—>+0 xX 


Consequently, at this point the function has no derivative and hence is not differ- 
entiable at the point. 


Example 9 We shall show that e*+” — e* = e*h + o(h) ash = 0. 
Thus, the function exp(x) = e” is differentiable and dexp(x)h = exp(x)h, or 
de* = e* dx, and therefore exp’ x = exp x, or ae =e, 
Proof 
ev th _ e% —e%(e# — 1) =e*(h + o0(h)) =e"h +0(h). 


Here we have used the formula e? — 1 = h + o(h) obtained in Example 39 of 
Sect. 3.2.4. 


Example 10 If a > 0, then a**" — a* = a" (Ina)h + o(h) as h + O. Thus da* = 
a* (Ina) dx and 4 = a* Ina. 
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qth _ge= a* (a! = 1) =a*(e"n = 1) = 


= a*(h Ina+o(h Ina)) =a‘ (Ina)jh+o(h) ash—0. 


Example 11 If x £0, then In|x +h|—In|x| = 4h +0(h) as h — 0. Thus dIn|x| = 


! ay and dIn |x| = a 
x dx x 


Proof 


In|x +A] — In|x| =In 


: 
1+-). 
x 


For |h| < |x| we have |1 + a =1+ A and so for sufficiently small values of i we 
can write 


h\ h h\ 1 
In|x +h| —In[x|=In( 1+ —)=—+0( —) =-h+o0(h) 
x Xx x x 


as h — 0. Here we have used the relation In(1 + t) = t+ o(t) as t > O, shown in 
Example 38 of Sect. 3.2.4. 


Example 12 If x #0 and 0 <a #1, then log, |x +h| —log, |x| = sh + o(h) as 
h—> 0. Thus, dlog, |x|= sho dx and Set — I 
Proof 


log, |x +h| — log, |x| =log, 


h h 
1+ —-|=log,{1+—-—]= 
x x 


1 h 1 (fh h 1 
= —In(1+ = +0 = h+o(h). 
Ina x Ina\x x xIna 


Here we have used the formula for transition from one base of logarithms to 
another and the considerations explained in Example 11. 


5.1.6 Problems and Exercises 


1. Show that 


a) the tangent to the ellipse 


8/4 
i) 
+ 
SI 
Ww] tw 
Il 
= 
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at the point (xo, yo) has the equation 


XX0 | YO, 
gat ee 


b) light rays from a source located at a focus F, = (-Va? — b?,0) or Fo = 
(Va? — b2, 0) of an ellipse with semiaxes a > b > 0 are gathered at the other focus 
by an elliptical mirror. 


2. Write the formulas for approximate computation of the following values: 


a) sin(% +a) for values of a near 0; 
b) sin(30° + @°) for values of a° near 0; 
c) cos(Z +a) for values of a near 0; 
d) cos(45° + @°) for values of a@° near 0. 


3. A glass of water is rotating about its axis at constant angular velocity w. Let 
y = f(x) denote the equation of the curve obtained by cutting the surface of the 
liquid with a plane passing through its axis of rotation. 


a) Show that f’(x) = x, where g is the acceleration of free fall. (See Exam- 
ple 5.) 

b) Choose a function f(x) that satisfies the condition given in part a). (See Ex- 
ample 6.) 

c) Does the condition on the function f(x) given in part a) change if its axis of 
rotation does not coincide with the axis of the glass? 


4. A body that can be regarded as a point mass is sliding down a smooth hill under 
the influence of gravity. The hill is the graph of a differentiable function y = f(x). 


a) Find the horizontal and vertical components of the acceleration vector that 
the body has at the point (xo, yo). 

b) For the case f(x) = x* when the body slides from a great height, find the 
point of the parabola y = x? at which the horizontal component of the acceleration 
is maximal. 


5. Set 


x, 
Wo(x) = 
ola eee ee 


and extend this function to the entire real line so as to have period |. We denote the 


extended function by gq. Further, let 


1 
Qn(x) = qa 00(4"x). 
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The function yg, has period 4~” and a derivative equal to +1 or —1 everywhere 
except at the points x = sot. k €Z. Let 


fx) => 5 on). 
n=1 


Show that the function f is defined and continuous on R, but does not have a 
derivative at any point. (This example is due to the well-known Dutch mathemati- 
cian B.L. van der Waerden (1903-1996). The first examples of continuous functions 
having no derivatives were constructed by Bolzano (1830) and Weierstrass (1860).) 


5.2 The Basic Rules of Differentiation 


Constructing the differential of a given function or, equivalently, the process of find- 
ing its derivative, is called differentiation.’ 


5.2.1 Differentiation and the Arithmetic Operations 


Theorem 1 /f functions f : X — R and g: X — R are differentiable at a point 
x € X, then 


a) their sum is differentiable at x, and 
(f+ 8)'@)=(f' +8)@); 
b) their product is differentiable at x, and 
(f 8) x) = fC) 8) + f(x) - 8'(x); 
c) their quotient is differentiable at x if g(x) #0, and 


FY. f’@)gx) — fog!) 
(x)= : 


g g2(x) 


Proof In the proof we shall rely on the definition of a differentiable function and 
the properties of the symbol o(-) proved in Sect. 3.2.4. 


7Although the problems of finding the differential and finding the derivative are mathematically 
equivalent, the derivative and the differential are nevertheless not the same thing. For that reason, 
for example, there are two terms in French — dérivation, for finding the derivative, and différentia- 
tion, for finding the differential. 
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a) (f+g)xth)—(ft+g)@)= 
= (f(x +h) + g(x +h)) -(f@)+8@)) = 
= (f(x +h) — f(x) + (g@ +h) -8@)) = 
= (f’(x)h t o(h)) + (g/(x)h + 0(h)) = (fe) + 8/(X)) A + o(h) = 


=(f'+2')(@)h+t oh). 
b) f-gath)—(f-g@= 


= f(x tAg(x th) — f(x)g(x) = 
= (f(x) + f’@)ht o(h)) (g(x) + g/(x)h + o(h)) — f(x) g(x) = 
= (f' (g(x) + f(xy’ (x))h + o(h). 


c) Since a function that is differentiable at a point x € X is continuous at that 
point, taking account of the relation g(x) 4 0 and the properties of continuous func- 
tions, we can guarantee that g(x + h) £0 for sufficiently small values of h. In the 
following computations it is assumed that h is small: 


(4)o+m-(2)e- 
g g 


_f@th f@_ 1 
girth) gi) g@e+h) 


=(=; es aay + OU) (FO) + Fh + oUtD)@(4) = 


(f(@ +A)g@x) — f@)g@+h)) = 


— f(x)(g(x) + 9/(x)h + 0(h))) = 
=(a5 Gy: o()) (fg) — f@)g'(@))h + 0(h)) = 


_ f'@)g@) — f)g'(x) 
g°(x) 
Here we have used the continuity of g at the point x and the relation g(x) 4 0 to 
deduce that 


h+o(h). 


1 1 
lim = a! 
h>0 g(x)g(xth) g(x) 


that is, 
1 — 
g(x)g(x +h) g2(x) 


where o(1) is infinitesimal ash > 0,x +he X. 


+o(1), 


Corollary 1 The derivative of a linear combination of differentiable functions 
equals the same linear combination of the derivatives of these functions. 
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Proof Since a constant function is obviously differentiable and has a derivative 
equal to 0 at every point, taking f = const = c in statement b) of Theorem 1, we 
find (cg) (x) =cg’(x). 

Now, using statement a) of Theorem |, we can write 


(ci f +28) (x) = (crf) (&) + (€2g)'(x) = 1 f(x) + 228"). 


Taking account of what has just been proved, we verify by induction that 


(er fi tess ten fn) &) = cr fi (%) ++ ++ + caf @)- 


Corollary 2 If the functions f\,..., f, are differentiable at x, then 


(fie fa) 0) = f[O) AO) fale) + 
+ fie) fox) fax) «+> fae) +--+ + fi) >>> fa-10) F, @)- 


Proof For n= | the statement is obvious. 

If it holds for some n € N, then by statement b) of Theorem | it also holds for 
(n+ 1) €N. By the principle of induction, we conclude that the formula is valid for 
anyneéN. 


Corollary 3 /t follows from the relation between the derivative and the differential 
that Theorem | can also be written in terms of differentials. To be specific: 


a) d(f + g)(x) =df (x) + dg(x); 
b) d(f- 9) = g(x) dfx) + fx) dg (x); 


c) d(£)(x) = EPSP LO if g(x) £0. 


Proof Let us verify, for example, statement a). 


d(f + g)\(xh=(ftg)a)h=(f'+e')@)h= 
=(f/'x) +9 ())h= f'@)ht+e'(h= 
= df (xh + dg(x)h = (df (x) + dg(x))h, 


and we have verified that d(f + g)(x) and df (x) + dg(x) are the same function. 


Example I (Invariance of the definition of velocity) We are now in a position to 
verify that the instantaneous velocity vector of a point mass defined in Sect. 5.1.1 
is independent of the Cartesian coordinate system used to define it. In fact we shall 
verify this for all affine coordinate systems. 
Let (x!, x”) and (¥!, ¥2) be the coordinates of the same point of the plane in two 
different coordinate systems connected by the relations 
gia ax" + ax + b!, 
(5.26) 


= azx' + exe +b’, 
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Since any vector (in affine space) is determined by a pair of points and its co- 
ordinates are the differences of the coordinates of the terminal and initial points of 
the vector, it follows that the coordinates of a given vector in these two coordinate 
systems must be connected by the relations 

pis atu! + we; 


. - (5.27) 


v= aju! +ajv". 

If the law of motion of the point is given by functions x(t) and x2(f) in one 
system of coordinates, it is given in the other system by functions x!(t) and X?(t) 
connected with the first set by relations (5.26). 

Differentiating relations (5.26) with respect to t, we find by the rules for differ- 
entiation 


Fal é . 
x = a,x! + alxz?, 


(5.28) 


22 ‘ 5 
x = ae + Cae 


Thus the coordinates (v!, v) = (x!, x”) of the velocity vector in the first system 


and the coordinates (t!, 5?) = (x x ) of the velocity vector in the second system 
are connected by relations (5.27), telling us that we are dealing with two different 
expressions for the same vector. 


Example 2 Let f(x) = tanx. We shall show that f’(x) = = oe at every point where 
cos x #0, that is, in the domain of definition of the function tanx = Sin 
It was shown in Examples 1 and 2 of Sect. 5.1 that sin’(x) = cosx and cos’ x = 


— sinx, so that by statement c) of Theorem | we find, when cos x 4 0, 


e , . i 
; sin sin’ x cos x — sinx cos’ x 
tan’ x —]()= 


cos cos? x 


cos x cosx + sinx sinx 1 


cos? x cos? x" 


1 


Example 3 cot' x = — wherever sin x 4 0, that is, in the domain of definition 


cosx sin’ x 
of cotx = Sing 
Indeed, 
/ « ‘ 
Fi cos cos’ x sinx — cos x sin’ x 
cot x = | — (x) => =) — 
sin sin” x 
—sinx sinx — cosx cosx 1 
sin? x sin? x 


Example 4 If P(x) =co +. cyx +++++ cyx" is a polynomial, then P’(x) = c, + 
9 sss ene. 
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Indeed, since a = 1, by Corollary 2 we have ax =nx"—!, and the statement 


now follows from Corollary 1. 


5.2.2 Differentiation of a Composite Function (Chain Rule) 


Theorem 2 (Differentiation of a composite function) If the function f :X > Y C 
R is differentiable at a point x € X and the function g : Y > R is differentiable at 
the point y = f (x) € Y, then the composite function go f : X — R is differentiable 
at x, and the differential d(g o f)(x): TR(x) > TR(g(f (x))) of their composition 
equals the composition df (y) o df (x) of their differentials 


df (x): TR) > TR(y=f()) and de(y = f(x): TRO) > TR(¢()). 
Proof The conditions for differentiability of the functions f and g have the form 
fath—fa)=f'(x)h+o(h) ah>0,x+heX, 
syt)-—s=s'O)tt+o@) ast>0, y+reY. 


We remark that in the second equality here the function o(t) can be considered 
to be defined for t = 0, and in the representation o(t) = y(t)t, where y(t) > 0 as 
t > 0, y+t € Y, we may assume y (0) = 0. Setting f(x) = y and f(x +h) =y+t, 
by the differentiability (and hence continuity) of f at the point x we conclude that 
t > O0ash— 0, and if x +h eX, then y +t € Y. By the theorem on the limit of a 
composite function, we now have 


y(f@+h)— f(x) =a(h) +0 ash>0, x+heX, 
and thus if t = f(x +h) — f(x) then 
ot)=y(fa+h)— f())(f@ +A) - f@)) = 
=a(h)(f'(x)h + o(h)) = ah) f' (x)h + a(hyo(h) = 
=o(h)+o(h)=o(h) ash>0,x+heXx, 
(go f(x +h) — (go f)(x) = 8(f +A) - 8(f@) = 
=g(y+t)-—g =e ()tto)= 
=s'(f@)(f@+h) — f@)) +0(f@ +h) — f@)= 
= e'( f(x) (f/@h + o(h)) + o(f (+h) — f@)) = 
= g'(f (x) (fh) + 8'(F(@)) (0) + 0(f & +h) — f()). 


Since we can interpret the quantity g’(f(x))(f’(x)A) as the value dg(f(x)) o 


df (x)h of the composition Ph aa ace) g'(f(x))- f’(x)h of the mappings h yi 
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de(y 
f'(x)h,t Ey g’(y)t at the displacement h, to complete the proof it remains only 
for us to remark that the sum 


g' (f(x) (o(h)) + o(f@ +4) — f(x) 


is infinitesimal compared with h ash > 0,x +h € X, or, as we have already estab- 
lished, 


o( f (x +h) — f (x)) =o(h) ash—0, x+heX. 
Thus we have proved that 
(go ffa+h)—-(goff@= 
= ¢'(f(x))- f'@h+ o(h) ash>0, x +heX. 


Corollary 4 The derivative (g o f)'(x) of the composition of differentiable real- 
valued functions equals the product g'(f (x))- f'(x) of the derivatives of these func- 
tions computed at the corresponding points. 


There is a strong temptation to give a short proof of this last statement in Leibniz’ 
notation for the derivative, in which if z = z(y) and y = y(x), we have 


dz dz dy 


dx dy dx’ 


which appears to be completely natural, if one regards the symbol & or oy not as a 
unit, but as the ratio of dz to dy or dy to dx. 
The idea for a proof that thereby arises is to consider the difference quotient 


Az Az Ay 
Ax Ay Ax 
and then pass to the limit as Ax — 0. The difficulty that arises here (which we also 


have had to deal with in part!) is that Ay may be 0 even if Ax £0. 


Corollary 5 If the composition (fn o--- 0 f\)(x) of differentiable functions y, = 
Si), ---.¥n = fn On-1) exists, then 


(fno---0 fi) (x) = frOn-D Sy—1 On—2) ++ A). 


Proof The statement is obvious if n = 1. 
If it holds for some n € N, then by Theorem 2 it also holds for n + 1, so that by 
the principle of induction, it holds for any n € N. 


Example 5 Let us show that for a € R we have a =ax%—! in the domain x > 0, 
that is, dx” = ax%—! dx and 


(x + h)® —x* =ax*'h+o(h) ash 0. 
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Proof We write x“ = e“'* and apply the theorem, taking account of the results of 
Examples 9 and 11 from Sect. 5.1 and statement b) of Theorem 1. 
Let g(y) =e” and y= f(x) =a In(x). Then x® = (g 0 f)(x) and 


(go fy'(x)=2'()- fi) =e = set. F Saxe, 
XxX X 


Example 6 The derivative of the logarithm of the absolute value of a differentiable 
function is often called its logarithmic derivative. 
Since F(x) = In| f(x)| = Uno] | o f)(x), by Example 11 of Sect. 5.1, we have 
7 ry) — £'@) 
F(x) = (dnl f)'@) = 4. 
Thus 


f(x) 4 AF @) 
x= : 
Fay f@) 


d(In If |) (x) = 


Example 7 (The absolute and relative errors in the value of a differentiable function 
caused by errors in the data for the argument) If the function f is differentiable 
at x, then 


fa +h) — f) = f'@)h +a; h), 


where a(x; h) =o(h) ash > 0. 

Thus, if in computing the value f(x) of a function, the argument x is determined 
with absolute error h, the absolute error | f(x +) — f(x)| in the value of the func- 
tion due to this error in the argument can be replaced for small values of h by the 
absolute value of the differential |d f (x)h| = | f’(x)h| at displacement h. 


If’@rl _ Idf@AI oF as the 


The relative error can then be computed as the ratio iar = fcr 
f'@) 


absolute value of the product | 109) ||h| of the logarithmic derivative of the function 
and the magnitude of the absolute error in the argument. 

We remark by the way that if f(x) = Inx, then dInx = oe and the absolute error 
in determining the value of a logarithm equals the relative error in the argument. 
This circumstance can be beautifully exploited for example, in the slide rule (and 
many other devices with nonuniform scales). To be specific, let us imagine that with 
each point of the real line lying right of zero we connect its coordinate y and write 
it down above the point, while below the point we write the number x = e”. Then 
y =Inx. The same real half-line has now been endowed with a uniform scale y and 
a nonuniform scale x (called logarithmic). To find In x, one need only set the cursor 
on the number x and read the corresponding number y written above it. Since the 
precision in setting the cursor on a particular point is independent of the number x 
or y corresponding to it and is measured by some quantity Ay (the length of the 
interval of possible deviation) on the uniform scale, we shall have approximately 
the same absolute error in determining both a number x and its logarithm y; and 
in determining a number from its logarithm we shall have approximately the same 
relative error in all parts of the scale. 
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Example 8 Let us differentiate a function u(x)", where u(x) and v(x) are dif- 
ferentiable functions and u(x) > 0. We write u(x)8© =e?) !"¥©) and use Corol- 
lary 5. Then 

dev@) Inu@) 


dx 


= eM) Inu) (ve Inu(x) + vy) = 
u(x) 


= u(x)’ . v'(x) Inu(x) + v(x)u(xyP Or! - u(x). 


5.2.3 Differentiation of an Inverse Function 


Theorem 3 (The derivative of an inverse function) Let the functions f : X — Y and 
f7!:Y => X be mutually inverse and continuous at points x9 € X and f (xo) = 
yo € Y respectively. If f is differentiable at xo and f'(x9) #0, then f—! is also 
differentiable at the point yo, and 


(f-')' (0) = (f'@0)) 


Proof Since the functions f : X — Y and f~!: ¥Y — X are mutually inverse, the 
quantities f(x) — f (xo) and f'UW) _ i (yo), where y = f (x), are both nonzero 
if x ~ xo. In addition, we conclude from the continuity of f at x9 and f~! at yo that 
(X 3x —> x0) & (Y 3 y > yo). Now using the theorem on the limit of a composite 
function and the arithmetic properties of the limit, we find 


.« £ Wf OO)... x—x0 
lim = in ———— = 
eer Y— Yo Xax x0 f (x) — f (xo) 
li 1 1 
= im 7 = => . 
X3x—> x9 (Lea foo) f/(%o) 


Thus we have shown that the function f~! : Y > X has a derivative at yo and 
that 


(f-')' Qo) = (f'@0)) 


Remark 1 If we knew in advance that the function f~! was differentiable at yo, 
we would find immediately by the identity (f~! o f)(x) = x and the theorem on 
differentiation of a composite function that (f —ly (yo) f’(xo) = 1. 


Remark 2 The condition f’(xo) 4 0 is obviously equivalent to the statement that 
the mapping ht» f’(xo)h realized by the differential df (xo) : TR(xo) ~ TR(0) 
has the inverse mapping [df (xo)]7! : TR( yo) > TR(xo) given by the formula t 
(f!(x0))7 Ht. 

Hence, in terms of differentials we can write the second statement in Theorem 3 
as follows: 
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If a function f is differentiable at a point xq and its differential df (xq) : 
TR(@o) > TRO) is invertible at that point, then the differential of the function 
f 7! inverse to f exists at the point yo = f (xo) and is the mapping 


df-'(y0) =[df (ao) | : TRO) > TR), 
inverse to df (x9): TR(xo) > TRO). 


1 
—y2 


Example 9 We shall show that arcsin’ y = for |y| < 1. The functions sin: 


y 
[—2/2, 2/2] — [—1, 1] and arcsin : [—1, 1] — [—2/2, 2/2] are mutually inverse 
and continuous (see Example 8 of Sect. 4.2) and sin’(x) = cosx £0 if |x| < 2/2. 
For |x| < 2/2 we have |y| < | for the values y = sinx. Therefore, by Theorem 3 


1 1 1 1 


sin’x  cosx aff = ae x Ps J1l—y2 


arcsin’ y = 


The sign in front of the radical is chosen taking account of the inequality cosx > 0 
for |x| < 7/2. 


Example 10 Reasoning as in the preceding example, one can show (taking account 
of Example 9 of Sect. 4.2) that 


1 
arccos’ y = -—————. for |y| < 1. 
Jl—y? 
Indeed, 
1 1 1 1 
arccos y = = = = : 
Cos! x sinx 1 — cos? x Vl—y? 


The sign in front of the radical is chosen taking account of the inequality sinx > 0 
ifO0<x<zZ. 


1 


Example 11 arctan’ y = Tp ve R. 
Indeed, 
1 , 1 1 
= i = cos oT ae -Ta 7: 
an’ x ( oe ) x y 
Example 12 arccot’ y = eee yeR. 
Indeed : 
1 1 1 
arccot’ y = = =—sin?x = 


colx (ce) l+co2x I+y?" 
s 


5.2 The Basic Rules of Differentiation 199 


Example 13 We already know (see Examples 10 and 12 of Sect. 5.1) that the func- 
tions y = f(x) =a* andx = f—!(y)= log, y have the derivatives f’(x) = a* Ina 
and (f~!)'() = Sag: 

Let us see how this is consistent with Theorem 3: 


_1)/ 1 1 1 
(f ) = St’) ~ at Ina ylna’ 
1 1 
‘x)= = =ylna=a‘* Ina. 
f(x) =o) GH ylna=a* Ina 


Example 14 The hyperbolic and inverse hyperbolic functions and their derivatives. 
The functions 


; iP 
sinhx = 5 (e —e*), 
Wise ed, ks 
coshx = mil +e =) 


are called respectively the hyperbolic sine and hyperbolic cosine® of x. 
These functions, which for the time being have been introduced purely formally, 
arise just as naturally in many problems as the circular functions sinx and cos x. 
We remark that 


sinh(—x) = — sinhx, 


cosh(—x) = coshx, 


that is, the hyperbolic sine is an odd function and the hyperbolic cosine is an even 
function. 
Moreover, the following basic identity is obvious: 


cosh? x — sinh? x = 1. 


The graphs of the functions y = sinhx and y = cosh x are shown in Fig. 5.7. 

It follows from the definition of sinh and the properties of the function e* that 
sinh x is a continuous strictly increasing function mapping R in a one-to-one manner 
onto itself. The inverse function to sinh x thus exists, is defined on R, is continuous, 
and is strictly increasing. 

This inverse is denoted arsinh y (read “area-sine of y”).? This function is easily 
expressed in terms of known functions. In solving the equation 


(ec -e*)=y 


8From the Latin phrases sinus hyperbolici and cosinus hyperbolici. 


°The full name is area sinus hyperbolici (Lat.); the reason for using the term area here instead of 
arc, as in the case of the circular functions, will be explained later. 
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Fig. 5.7 


for x, we find successively 
e=y+/1+y? 


(e* > 0, and soe* A y—./1+ y?) and 


x=In(y+4/1+y?). 
arsinh y =In(y+/1+y?), yeR. 


Thus, 


Similarly, using the monotonicity of the function y = coshx on the two inter- 
vals R_ = {x € R| x < 0} and Ry = {x € R| x => O}, we can construct functions 
arcosh_ y and arcosh, y, defined for y > 1 and inverse to the function coshx on 


R_ and Ry, respectively. 
They are given by the formulas 


arcosh_ y = In(y — \/y?— 1), 
arcoshy y = In(y + yy? — 1). 


From the definitions given above, we find 
sinh’ x = Je: + e~*) =coshx 
2 ’ 
/ 1 x —x ‘ 
cosh x = AG —e ) =sinhx, 


and by the theorem on the derivative of an inverse function, we find 


1 1 1 1 
arsinh’ y = = = = : 
4 sinh’x coshx V1 + sinh2 x nal + y2 
hi! 1 1 1 1 
arcos — = = — ; 
-* = Cosh’x  sinhx ae er ee | /y2 1 


y>l, 
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Me 1 = 1 
cosh’x  sinhx J cosh? x — 1 ~ Jy2-1 
These last three relations can be verified by using the explicit expressions for the 


inverse hyperbolic functions arsinh y and arcosh y. 
For example, 


arcosh’, y = , yol. 


arsinh’ y = 


1 1 -1/2 
1+-(1+y* -2y) = 
ae 3 ) 
_ 1 yey ey 1 
yt+Jfl¢y Jlt+y? 3 JSlt+y? 


Like tanx and cotx one can consider the functions 


sinh x coshx 
and cothx = — : 
cosh x sinh x 


tanhx = 


called the hyperbolic tangent and hyperbolic cotangent respectively, and also the 
functions inverse to them, the area tangent 


1. i+y 
artanh y = —In » bil<l, 
2 1-y 
and the area cotangent 
1 1 
fees a » |yl>l 
2 y-1 


We omit the solutions of the elementary equations that lead to these formulas. 
By the rules for differentiation we have 


; sinh’ x cosh x — sinh x cosh’ x 
tanh’ x = 5 = 
cosh* x 


coshx coshx — sinhx sinhx 1 


—* ’ 
cosh? x cosh? x 


F cosh’ x sinh x — cosh x sinh’ x 
coth x = aa = 
sinh* x 


sinh x sinh x — coshx coshx 1 


sinh? x sinh? x 
By the theorem on the derivative of an inverse function 
/ 1 1 2 
artanh’ y = = = cosh’ x = 


tanh’ x eee 
( cosh? x 


1 


~ 1—tanh?x  1— y2’ 


ly| <1, 
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' 1 al 
arcoth y = == i = —sinh*x = 
cothx (——, 
sinh* x 


1 sy 
coth?x—1 y 


The last two formulas can also be verified by direct differentiation of the explicit 
formulas for the functions artanh y and arcoth y. 


5.2.4 Table of Derivatives of the Basic Elementary Functions 


We now write out (see Table 5.1) the derivatives of the basic elementary functions 
computed in Sects. 5.1 and 5.2. 


5.2.5 Differentiation of a Very Simple Implicit Function 


Let y = y(t) and x = x(t) be differentiable functions defined in a neighborhood 
U (to) of a point fo € R. Assume that the function x = x(f) has an inverse f = f(x) 
defined in a neighborhood V (xo) of x9 = x(to). Then the quantity y = y(t), which 
depends on f, can also be regarded as an implicit function of x, since y(t) = y(t(x)). 
Let us find the derivative of this function with respect to x at the point xo, assuming 
that x’(t9) 4 0. Using the theorem on the differentiation of a composite function and 
the theorem on differentiation of an inverse function, we obtain 


d 
on dy(t(x))| = dy(t)|— dt) Prato yf (t0) 
Xlx=xg dx x=xQ dt t=to dx xX=XxQ ao lt=to Xt (to) 


(Here we have used the standard notation f(x)|y=x,) ‘= f(%o).) 

If the same quantity is regarded as a function of different arguments, in order to 
avoid misunderstandings in differentiation, we indicate explicitly the variable with 
respect to which the differentiation is carried out, as we have done here. 


Example 15 (The law of addition of velocities) The motion of a point along a line 
is completely determined if we know the coordinate x of the point in our chosen 
coordinate system (the real line) at each instant ¢ in a system we have chosen for 
measuring time. Thus the pair of numbers (x, t) determines the position of the point 
in space and time. The law of motion is written in the form of a function x = x(t). 
Suppose we wish to express the motion of this point in terms of a different co- 
ordinate system (X,7). For example, the new real line may be moving uniformly 
with speed —v relative to the first system. (The velocity vector in this case may be 
identified with the single number that defines it.) For simplicity we shall assume 
that the coordinates (0,0) refer to the same point in both systems; more precisely, 
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Table 5.1 
Function f (x) Derivative f’(x) Restrictions on domain of x € R 
1. C (const) 0 
2.x axe! x>OforaeR 
xé€RforaeN 

3.a* a‘ \|na xER(a>0,aF41) 
4. log, |x| aw x ER\O(a>0,a41) 
5. sinx cos x 
6. cos x —sinx 
7. tanx area xAG+tkK KEL 

1 
8. cotx aoe xAmk,keZ 
9. arcsin x ! |x| <1 

1—x? 

10. arccos x Saal |x| <1 

1—x? 
11. arctan x a 

1 
12. arccot x aoe weed 
13. sinhx cosh x 
14. coshx sinh x 

1 

15. tanh x costs 
16. cothx oe nae x40 
17. arsinhx = In(x + V1 4+ x?) a 
18. arcoshx = In(x + Vx? — 1) + |x| >1 

om 
19. artanhx = 5 In tS a |x| <1 
20. arcothx = 5 In a wy |x| >1 


that at time ¢ = 0 the point x = 0 coincided with the point x = 0 at which the clock 
showed t = 0. 

Then one of the possible connections between the coordinate systems (x, ft) and 
(x,¢) describing the motion of the same point observed from different coordinate 
systems is provided by the classical Galilean transformations: 


x=x-+u0t, 
(5.29) 
t=t 
Let us consider a somewhat more general linear connection 
x=ax-+ ft, 
= (5.30) 
t=yx-+ot, 


assuming, of course, that this connection is invertible, that is, the determinant of the 


matrix ( :) is not zero. 
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Let x = x(t) and x = X(f) be the law of motion for the point under observation, 
written in these coordinate systems. 
We remark that, knowing the relation x = x(t), we find by formula (5.30) that 


x(t) =ax(t) + Br, 


. (5.31) 
i(t) = yx(t) +41, 
and since the transformation (5.30) is invertible, after writing 
x = ak + Bi, 
es (5.32) 
t=yx+6t, 
knowing x = X(f), we find 
x(t) =G%0) + Bi, 
(5.33) 


t(f) = px(f) + SF. 


It is clear from relations (5.31) and (5.33) that for the given point there exist 
mutually inverse functions f = f(t) and t = t(f). 
We now consider the problem of the connection between the velocities 


OM 2 iy. gal Vj =e 


V@Q)= = 
© dt dt 


of the point computed in the coordinate systems (x, t) and (x, ¢) respectively. 
Using the rule for differentiating an implicit function and formula (5.31), we have 


di F af +B 
== at 
dc tf yGts 
or 
io V(t 
(oes (5.34) 
yV+5 


where ¢ and ¢ are the coordinates of the same instant of time in the systems (x, f) 
and (x, 7). This is always to be kept in mind in the abbreviated notation 


aV+B 


V= 
yV+6 


(5.35) 


for formula (5.34). 
In the case of the Galilean transformations (5.29) we obtain the classical law of 
addition of velocities from formula (5.35) 


V=V+u. (5.36) 
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It has been established experimentally with a high degree of precision (and this 
became one of the postulates of the special theory of relativity) that in a vacuum 
light propagates with a certain velocity c that is independent of the state of motion 
of the radiating body. This means that if an explosion occurs at time t = 7 = 0 at 
the point x = x = 0, the light will reach the points x with coordinates such that 
x* = (ct)? after time ¢ in the coordinate system (x,t), while in the system (x, 7) 
this event will correspond to time 7 and coordinates *, where again <7 = (cf)*. 

Thus, if x2 — c?#? =0, then ¥2 — cf? =0 also, and conversely. By virtue of 
certain additional physical considerations, one must consider that, in general 


ee ae Ser, (5.37) 
if (x, t) and (%, f) correspond to the same event in the different coordinate systems 
connected by relation (5.30). Conditions (5.37) give the following relations on the 
coefficients a, 6, y, and 6 of the transformation (5.30): 


ae cy? = 
aB —c*y5 =0, (5.38) 
B? — c76* = —c’. 


If c= 1, we would have, instead of (5.38), 


a —y?= 1, 

BY 

ee (5.39) 
ae —l, 


from which it follows easily that the general solution of (5.39) (up to a change of 
sign in the pairs (a, 6) and (y, 6)) can be given as 


a =coshg, y =sinhg, B=sinhg, 6 =coshg, 


where ¢ is a parameter. 
The general solution of the system (5.38) then has the form 


a B\  (coshg csinhg 
y 5) ; sinhg coshg 
and the transformation (5.30) can be made specific: 
x =coshgx + csinhgt, 
(5.40) 


= il 
t = —sinhgx + coshgt. 
c 


This is the Lorentz transformation. 
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In order to clarify the way in which the free parameter g is determined, we recall 
that the x-axis is moving with speed —v relative to the x-axis, that is, the point x = 0 
of this axis, when observed in the system (x, t) has velocity —v. Setting x = 0 in 
(5.40), we find its law of motion in the system (x, f): 


x =-—ctanhgt. 
Therefore, 


v 
tanhg = -. 
c 


(5.41) 


Comparing the general law (5.35) of transformation of velocities with the Lorentz 
transformation (5.40), we obtain 


Te coshgV + csinhge 


1 sinhgV +coshg’ 


or, taking account of (5.41), 


(5.42) 


Formula (5.42) is the relativistic law of addition of velocities, which for 
juVl< c, that is, as c + oo, becomes the classical law expressed by formula 
(5.36). 

The Lorentz transformation (5.40) itself can be rewritten taking account of rela- 
tion (5.41) in the following more natural form: 


x + ut 


Uv 
t+ ax 


x= 


(5.43) 


from which one can see that for |v| < c, that is, as c > oo, they become the classical 
Galilean transformations (5.29). 


5.2.6 Higher-Order Derivatives 


If a function f : E > R is differentiable at every point x € FE, then a new function 
f': E — R arises, whose value at a point x € E equals the derivative f’(x) of the 
function f at that point. 

The function f’ : E + R may itself have a derivative (f’)’ : E > R on E, called 
the second derivative of the original function f and denoted by one of the following 
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two symbols: 
& f(x) 
dx? 


and if we wish to indicate explicitly the variable of differentiation in the first case, 
we also write, for example, f/’ (x). 


f"), 


’ 


Definition By induction, if the derivative f“~!)(x) of order n — 1 of f has been 
defined, then the derivative of order n is defined by the formula 


FOC) = (FOP). 
The following notations are conventional for the derivative of order n: 


a fe), 


dx” 


f@), 


Also by convention, f Ox) r= f(). 

The set of functions f : E — R having continuous derivatives up to order 
n inclusive will be denoted C”(E, IR), and by the simpler symbol C(E), or 
C"(E,R) and C”(E) respectively wherever no confusion can arise. 

In particular C (E) =C(E) by our convention that f Ox) = Ff (x). 

Let us now consider some examples of the computation of higher-order deriva- 
tives. 


Examples 
f(x) f'@) f" (x) os f(x) 
16) a* a*|na a*In2a tee a* In" a 
17) e* e* e* tee e* 
18) sinx COs x — sinx sae sin(x + n7/2) 
19) cos x —sinx —Ccosx tee cos(x + nz /2) 
20) (+x)® atx)! a@—-D+x)22 (a —1)--- (@ nt DA +x)" 
21) xe ax! a(a — 1)x%~2 ee a(a—1)-+-(a—n+1)x%" 
22) — logy |x| ay) an ee ne (=D! 
23) In|x| x7! (-l)x7? Pe (-1)""! = Dix7? 


Example 24 (Leibniz’ formula) Let u(x) and v(x) be functions having derivatives 
up to order 1 inclusive on a common set E. The following formula of Leibniz holds 
for the nth derivative of their product: 


n 


(uv) = » C) ym) yim) (5.44) 


m=0 


Leibniz’ formula bears a strong resemblance to Newton’s binomial formula, and 
in fact the two are directly connected. 
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Proof For n = 1 formula (5.44) agrees with the rule already established for the 
derivative of a product. 

If the functions u and v have derivatives up to order n + | inclusive, then, assum- 
ing that formula (5.44) holds for order n, after differentiating the left- and right-hand 
sides, we find 


n n 
(uvy@t) = > (7) yarrmtD ym) 4 3 ) yarm) ym) _ 


m=0 m=0 
n 
=u Dy 4 > ((Z) 4 Ge :)) y@t)-b) yO) 4 yO) y+) — 
k=1 

n+l 
=> (" + ') ylrt)-¥) 
= i 

k=0 


Here we have combined the terms containing like products of derivatives of the 


functions wu and v and used the binomial relation (7) + ( ny) = eas ): 


Thus by induction we have established the validity of Leibniz’ formula. 


Example 25 If Py(x) = co + c1x +--+ + ¢nx", then 
Pn (0) = co, 
Pl (x) =e) + 2cgx +--+ +nepx"—! and P/ (0) =c1, 
PY (x) = 2c2 +3+2c3x +--+ +n(n — Dcnx"~* and P” (0) = 2!co, 
PE) (x) =3-2c3+-+-n(n — In — 2enx"~3 and P® (0) = 3!c3, 


P® (x) =n(n — 1)(n — 2)-++2en and P™ (0) =nlen, 
P®(x)=0 fork>n. 


Thus, the polynomial P, (x) can be written as 


1 1 1 
P, (x) = POO) + qn Ox fe ain (O)x2 -..-4 P™ (O)x". 


Example 26 Using Leibniz’ formula and the fact that all the derivatives of a poly- 


nomial of order higher than the degree of the polynomial are zero, we can find the 
nth derivative of f(x) = x? sinx: 


f(x) = sin™ (x) x? + i) sin®—) x. 2x + a) sin?) x.2= 


2: Uv : 18 
=x" sin ttn, + 2nx sin z+ (a—I)> + 
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+ (-nn _ isin +n3)) = 
= (a —n(n—1)) sin( +3) —2nx cos(x +n5). 


Example 27 Let f(x) = arctan x. Let us find the values f™(O) (n=1,2,...). 
Since f’(x) = , it follows that (1 + x7) f’(x) = 1. 
4 1+ x2 


Applying Leibniz’ formula to this last equality, we find the recursion relation 


(1+.x7) fOTP (x) + 2nxf™ (x) +0(n— 1) f° (x) =0 


from which one can successively find all the derivatives of f(x). 
Setting x = 0, we obtain 


FOV G) ee 1 f"-Y0). 


For n = 1| we find f (2)(0) = 0, and therefore f (2n) (0) = 0. For derivatives of odd 
order we have 


fP"tD CO) = —2m(2m — 1) f P"—-Y (0) 
and since f’(0) = 1, we obtain 


f+) ) = (-1)"(2m)!. 


Example 28 (Acceleration) If x = x(t) denotes the time dependence of a point mass 
moving along the real line, then a = x(t) is the velocity of the point, and then 


2, 
GO — g Lee X(t) is its acceleration at time f. 


If x(t) =at+ B, then x(t) = a@ and x(t) = 0, that is, the acceleration in a uniform 
motion is zero. We shall soon verify that if the second derivative equals zero, then the 
function itself has the form at + f. Thus, in uniform motions, and only in uniform 
motions, is the acceleration equal to zero. 

But if we wish for a body moving under inertia in empty space to move uniformly 
in a straight line when observed in two different coordinate systems, it is necessary 
for the transition formulas from one inertial system to the other to be linear. That 
is the reason why, in Example 15, the linear formulas (5.30) were chosen for the 
coordinate transformations. 


Example 29 (The second derivative of a simple implicit function) Let y = y(t) and 
x = x(t) be twice-differentiable functions. Assume that the function x = x(t) has a 
differentiable inverse function t = t(x). Then the quantity y(t) can be regarded as 
an implicit function of x, since y = y(t) = y(t(x)). Let us find the second derivative 
yyy assuming that x’(t) 4 0. 

By the rule for differentiating such a function, studied in Sect. 5.2.5, we have 
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so that 
/ 
Yryr YeeXr—VeX tt 
1y\I of oe ei rt ny 
y" ( ai OW er Cp? Xe Ver — Xe Vt 
xx ~ \¥x)y a 1S, / ae / 
Xt Xt Xt Gy 


We remark that the explicit expressions for all the functions that occur here, 
including y;’,, depend on ¢, but they make it possible to obtain the value of yj, at 
the particular point x after substituting for ¢ the value t = t (x) corresponding to the 
value x. 

For example, if y =e! and x = Inf, then 


io Ee y! _ Ot _ et tte’ 
aK Aft , came 1/t 


=t(t+De’. 


We have deliberately chosen this simple example, in which one can explicitly 
express ¢ in terms of x as t = e* and, by substituting t = e* into y(t) =e’, find 
the explicit dependence of y = e® on x. Differentiating this last function, one can 
justify the results obtained above. 

It is clear that in this way one can find the derivatives of any order by successively 
applying the formula 


-1 
(n) cae : t 
Mi Se x 
5.2.7 Problems and Exercises 
1. Let ag, a, ...,@, be given real numbers. Exhibit a polynomial P,,(x) of degree 
n having the derivatives p& (xo) =ax,k =0,1,...,n, at a given point x9 € R. 


2. Compute f’(x) if 


1 
exp(— <5) for x £0, 
a x)= 
nee) f for x = 0; 
b) f(x) x? sin for x £0, 
x)= 
0 forx =0. 


c) Verify that the function in part a) is infinitely differentiable on R, and that 
f™(O)=0. 
d) Show that the derivative in part b) is defined on R but is not a continuous 
function on R. 
e) Show that the function 
fa)= ae ae for —l<x<l, 
0 


for 1 < |x| 


is infinitely differentiable on R. 
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3. Let f € C‘)(R). Show that for x 4 0 


1 f(2) =f 1)" d” ("4(<)) 
xn x dx" xJ} 


4. Let f be a differentiable function on R. Show that 


a) if f is an even function, then f’ is an odd function; 
b) if f is an odd function, then f’ is an even function; 
c) (f’ is odd) > (f is even). 


5. Show that 


a) the function f(x) is differentiable at the point xo if and only if f(x) — 
Ff (xo) = g(x) (% — x0), where g(x) is a function that is continuous at xg (and in 
that case y(xo) = f’ (xo)); 

b) if f(x) — f (xo) = g(x) (x — xo) and g € C"—) (U(x)), where U(xo) is a 
neighborhood of xg, then f(x) has a derivative (f”) (xo)) of order n at xg. 


6. Give an example showing that the assumption that f—! be continuous at the point 
yo cannot be omitted from Theorem 3. 

7. a) Two bodies with masses m, and m2 respectively are moving in space under 
the action of their mutual gravitation alone. Using Newton’s laws (formulas (5.1) 
and (5.2) of Sect. 5.1), verify that the quantity 


1 , 1 > mim? 
E= preity Bat, + er =K+U, 


where v, and v2 are the velocities of the bodies and r the distance between them, 
does not vary during this motion. 
b) Give a physical interpretation of the quantity E = K + U and its components. 
c) Extend this result to the case of the motion of n bodies. 


5.3 The Basic Theorems of Differential Calculus 
5.3.1 Fermat’s Lemma and Rolle’s Theorem 


Definition 1 A point xo € E C R is called a local maximum (resp. local minimum) 
and the value of a function f : E — R at that point a local maximum value (resp. 
local minimum value) if there exists a neighborhood Uf (xo) of xo in E such that at 
any point x € Ug(xo) we have f(x) < f(xo) (resp. f(x) => f(x%o)). 


Definition 2 If the strict inequality f(x) < f(xo) (resp. f(x) > f(xo)) holds at 
every point x € Ug(xo)\x0 = U E(x), the point xo is called strict local maximum 
(resp. strict local minimum) and the value of the function f : E > R a strict local 
maximum value (resp. strict local minimum value). 
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Fig. 5.8 
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Definition 3 The local maxima and minima are called local extrema and the values 
of the function at these points local extreme values of the function. 


Example I Let 


fe) x*, if -1<x <2, 
x)= 
4, if2<x 
(see Fig. 5.8). For this function 
x = —1 is a strict local maximum; 


x = 0 is a strict local minimum; 
x = 2 isa local maximum; 


the points x > 2 are all local extrema, being simultaneously maxima and minima, 
since the function is locally constant at these points. 


Example 2 Let f(x) =sin + on the set E =R\0. 
The points x = (5 + 2km)—!, k € Z, are strict local maxima, and the points 
— (=> +2k2)~!, k € Z, are strict local minima for Ff (x) (see Fig. 4.1). 


Definition 4 An extremum x9 € E of the function f : E > R is called an in- 
terior extremum if xo is a limit point of both sets E_ = {x € E | x < xo} and 
Ex, ={xeE|x>xo}. 


In Example 2, all the extrema are interior extrema, while in Example | the point 
x = —1 is not an interior extremum. 


Lemma 1 (Fermat) Jf a function f : E — R is differentiable at an interior ex- 
tremum, xo € E, then its derivative at xo is 0: f’(xo) =0. 


Proof By definition of differentiability at x9 we have 


f(x0 +h) — f (x0) = f' (xo)h + a(x; AYA, 


where @(x93h) > Oash>x,xp +heE. 
Let us rewrite this relation as follows: 


f (x0 +h) — f (x0) = Lf’ 0) + (0; hy A. (5.45) 
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Since xg is an extremum, the left-hand side of Eq. (5.45) is either non-negative 
or nonpositive for all values of h sufficiently close to 0 and for which x9 +h e E. 

If f’(xo) #0, then for h sufficiently close to 0 the quantity f’(xo) + a(xo; h) 
would have the same sign as f’(xo), since a(xo;h) > Oash>0, x0 +heE E. 

But the value of / can be both positive or negative, given that xg is an interior 
extremum. 

Thus, assuming that f’(xo) 4 0, we find that the right-hand side of (5.45) 
changes sign when hh does (for h sufficiently close to 0), while the left-hand side 
cannot change sign when h is sufficiently close to 0. This contradiction completes 
the proof. 


Remarks on Fermat’s Lemma 1°, Fermat’s lemma thus gives a necessary condition 
for an interior extremum of a differentiable function. For noninterior extrema (such 
as the point x = —1 in Example 1) it is generally not true that f’(xo) = 0. 

2°. Geometrically this lemma is obvious, since it asserts that at an extremum of a 
differentiable function the tangent to its graph is horizontal. (After all, f’(xo) is the 
tangent of the angle the tangent line makes with the x-axis.) 

3°. Physically this lemma means that in motion along a line the velocity must be 
zero at the instant when the direction reverses (which is an extremum!). 

This lemma and the theorem on the maximum (or minimum) of a continuous 
function on a closed interval together imply the following proposition. 


Proposition 1 (Rolle’s!° theorem) If a function f : [a,b] — R is continuous on a 
closed interval (a, b] and differentiable on the open interval la, b[ and f (a) = f(b), 
then there exists a point & € Ja, b[ such that f'(&) =0. 


Proof Since the function f is continuous on [a, b], there exist points x,xy € 
[a, b] at which it assumes its minimal and maximal values respectively. If f (xm) = 
f (xm), then the function is constant on [a, b]; and since in that case f’(x) =0, the 
assertion is obviously true. If f(x) < f(xy), then, since f(a) = f(b), one of the 
points x,, and xj must lie in the open interval Ja, b[. We denote it by €. Fermat’s 
lemma now implies that f’(€) = 0. 


5.3.2 The Theorems of Lagrange and Cauchy on Finite 
Increments 


The following proposition is one of the most frequently used and important methods 
of studying numerical-valued functions. 


Theorem 1 (Lagrange’s finite-increment theorem) Jf a function f : [a,b] > R is 
continuous on a closed interval [a, b| and differentiable on the open interval, |a, bl, 


10M. Rolle (1652-1719) — French mathematician. 
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Fig. 5.9 y 


there exists a point € € ja, b[ such that 


fb) — f@= f' EG —a). (5.46) 


Proof Consider the auxiliary function 


b) — 
F(x) = f(x) LO~ LO, a), 


which is obviously continuous on the closed interval [a, b] and differentiable on the 
open interval Ja, b[ and has equal values at the endpoints: F(a) = F(b) = f(a). 
Applying Rolle’s theorem to F(x), we find a point € € Ja, b[ at which 


foe)—f@_ 
POLO) 


F'(—) = f'(E) — 0. 


Remarks on Lagrange’s Theorem 1°. In geometric language Lagrange’s theorem 
means (see Fig. 5.9) that at some point (€, f(€)), where & € Ja, b[, the tangent to 
the graph of the function is parallel to the chord joining the points (a, f(a)) and 
(b, f (b)), since the slope of the chord equals LO f@) 


2°. If x is interpreted as time and f(b) — f(a) as the amount of displacement 
over the time b — a of a particle moving along a line, Lagrange’s theorem says that 
the velocity f’(x) of the particle at some time & € Ja, b[ is such that if the particle 
had moved with the constant velocity f’(€) over the whole time interval, it would 
have been displaced by the same amount f(b) — f(a). It is natural to call f’(&) the 
average velocity over the time interval [a, b]. 

3°. We note nevertheless that for motion that is not along a straight line there 
may be no average speed in the sense of Remark 2°. Indeed, suppose the particle is 
moving around a circle of unit radius at constant angular velocity m = 1. Its law of 
motion, as we know, can be written as 


r(t) = (cost, sinf). 
Then 
r(t) = v(t) = (— sint, cost) 


and |y| = V sin? t + cos?¢ = 1. 
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The particle is at the same point r(O) = r(277) = (1, 0) at times t = 0 and t = 27, 
and the equality 


r(27) — r(0) = v(é) (21 — 0) 


would mean that v(é) = 0. But this is impossible. 

Even so, we shall learn that there is still a relation between the displacement over 
a time interval and the velocity. It consists of the fact that the full length L of the 
path traversed cannot exceed the maximal absolute value of the velocity multiplied 
by the time interval of the displacement. What has just been said can be written in 
the following more precise form: 


|r(b) —r(a)| < sup |r(t)||b — al. (5.47) 
teja,b[ 


As will be shown later, this natural inequality does indeed always hold. It is also 
called Lagrange’s finite-increment theorem, while relation (5.46), which is valid 
only for numerical-valued functions, is often called the Lagrange mean-value the- 
orem (the role of the mean in this case is played by both the value f’(&) of the 
velocity and by the point € between a and b). 

4°. Lagrange’s theorem is important in that it connects the increment of a func- 
tion over a finite interval with the derivative of the function on that interval. Up to 
now we have not had such a theorem on finite increments and have characterized 
only the local (infinitesimal) increment of a function in terms of the derivative or 
differential at a given point. 


Corollaries of Lagrange’s Theorem 


Corollary 1 (Criterion for monotonicity of a function) [f the derivative of a function 
is nonnegative (resp. positive) at every point of an open interval, then the function is 
nondecreasing (resp. increasing) on that interval. 


Proof Indeed, if x; and x2 are two points of the interval and x; < x2, that is, x2 — 
x1 > 0, then by formula (5.46) 


f (x2) — f (1) = f'(E) (42 — x1), where x1 < & < x2, 


and therefore, the sign of the difference on the left-hand side of this equality is the 
same as the sign of f’(&). 


Naturally an analogous assertion can be made about the nonincreasing (resp. 
decreasing) nature of a function with a nonpositive (resp. negative) derivative. 


Remark By the inverse function theorem and Corollary | we can conclude, in par- 
ticular, that if a numerical-valued function f(x) on some interval J has a derivative 
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that is always positive or always negative, then the function is continuous and mono- 
tonic on J and has an inverse function f~! that is defined on the interval J’ = f (/) 
and is differentiable on it. 


Corollary 2 (Criterion for a function to be constant) A function that is continuous 
on a closed interval [a, b] is constant on it if and only if its derivative equals zero at 
every point of the interval [a, b] (or only the open interval Ja, b{). 


Proof Only the fact that f’(x) =0 on Ja, b[ implies that f(x) = f (x2) for all 
x1, x2, € [a, b] is of interest. But this follows from Lagrange’s formula, according 
to which 


f (x2) — fa) = f'€)@2 — x1) =0, 


since & lies between x; and x2, that is, € € Ja, b[, and so f’(é) =0. 


Remark From this we can draw the following conclusion (which as we shall see, 
is very important for integral calculus): Jf the derivatives Fj (x) and F}(x) of two 
functions F\(x) and Fy(x) are equal on some interval, that is, F; (x) = F3(x) on 
the interval, then the difference F\(x) — F(x) is constant. 


The following proposition is a useful generalization of Lagrange’s theorem, and 
is also based on Rolle’s theorem. 


Proposition 2 (Cauchy’s finite-increment theorem) Let x = x(t) and y = y(t) be 
functions that are continuous on a closed interval [a, B] and differentiable on the 
open interval |a, B[. 

Then there exists a point t € [a, B] such that 


x'(t)(y(B) — y@)) = y'(x)(x(B) — x(@)). 


If in addition x’ (t) 4 0 for eacht € Ja, BL, then x(a) 4 x(B) and we have the equal- 
ity 
y(B) — y(@) _ ¥'@ 
x(B)— x(a) x!(t) 


Proof The function F(t) = x(t)(y(B) — y(@)) — y(t)(x(B) — x(@)) satisfies the 
hypotheses of Rolle’s theorem on the closed interval [a, 6]. Therefore there exists 
a point t € Ja, B[ at which F’(t) = 0, which is equivalent to the equality to be 
proved. To obtain relation (5.48) from it, it remains only to observe that if x’(t) 40 
on Ja, B[, then x(a) 4 x(B), again by Rolle’s theorem. 


(5.48) 


Remarks on Cauchy’s Theorem 1°. If we regard the pair x(t), y(t) as the law 
of motion of a particle, then (x’(t), y’(t)) is its velocity vector at time t, and 
(x(B) — x(a), y(B) — y(q@)) is its displacement vector over the time interval [a, 6]. 
The theorem then asserts that at some instant of time t € [a, 8] these two vectors 
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are collinear. However, this fact, which applies to motion in a plane, is the same kind 
of pleasant exception as the mean-velocity theorem in the case of motion along a 
line. Indeed, imagine a particle moving at uniform speed along a helix. Its velocity 
makes a constant nonzero angle with the vertical, while the displacement vector can 
be purely vertical (after one complete turn). 

2°. Lagrange’s formula can be obtained from Cauchy’s by setting x = x(t) =¢, 
y(t) = v(x) = f(x), a=a, B=b. 


5.3.3 Taylor’s Formula 


From the amount of differential calculus that has been explained up to this point 
one may obtain the correct impression that the more derivatives of two functions 
coincide (including the derivative of zeroth order) at a point, the better these func- 
tions approximate each other in a neighborhood of that point. We have mostly been 
interested in approximations of a function in the neighborhood of a point by a poly- 
nomial P,(x) = Py(xo; x) = co + c1(x — x9) +--+ + n(x — x0)”, and that will 
continue to be our main interest. We know (see Example 25 in Sect. 5.2.6) that an 
algebraic polynomial can be represented as 


: (n) 
Py (2) = Py(t0) + 22 (x — x0) +--+ EO) 


(x ~ xo)” ’ 


& 
that is, cx = “0 (k =0, 1,...,). This can easily be verified directly. 
Thus, if we are given a function f(x) having derivatives up to order 1 inclusive 
at x9, we can immediately write the polynomial 


Pn(xo; X) = Pa(x) = f (x0) + (x — x0)", (5.49) 


/ (n) 
LOO (xo) ttf =o 


whose derivatives up to order 7 inclusive at the point xg are the same as the corre- 
sponding derivatives of f(x) at that point. 


Definition 5 The algebraic polynomial given by (5.49) is the Taylor!! polynomial 
of order n of f(x) at xo. 


We shall be interested in the value of 


f(x) — Py (%03 x) = rn (X03 x) (5.50) 


of the discrepancy between the polynomial P,, (x) and the function f(x), which is 
often called the remainder, more precisely, the nth remainder or the nth remainder 


Np, Taylor (1685-1731) — British mathematician. 
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term in Taylor’s formula: 


f' (x0) f™ (xo) 
1! 


f(x) = fo) Bag) pe ee xo)" +n (xo; x). | (5.51) 


The equality (5.51) itself is of course of no interest if we know nothing more 
about the function r, (xo; x) than its definition (5.50). 

We shall now use a highly artificial device to obtain information on the remainder 
term. A more natural route to this information will come from the integral calculus. 


Theorem 2 /f the function f is continuous on the closed interval with end-points 
xq and x along with its first n derivatives, and it has a derivative of order n + 1 
at the interior points of this interval, then for any function that is continuous on 
this closed interval and has a nonzero derivative at its interior points, there exists a 
point & between xo and x such that 


ie) = PO) PRD) pone D (g(x —&)". (5.52) 
gl (é)n! 


Proof On the closed interval J with endpoints x9 and x we consider the auxiliary 
function 


F(t) = f(x) — Pat; x) (5.53) 


of the argument t. We now write out the definition of the function F(t) in more 
detail: 


’ () (t 


Fo=s)-[r0+ Po He 


We see from the definition of the function F(t) and the hypotheses of the theorem 
that F is continuous on the closed interval J and differentiable at its interior points, 
and that 


F'(tj)= ro ie + a (x —f) Au (x —t)+ 
m (n+1) (n+1) 
ae f (t) (x t) ioe ae Fi (t) (x a | = f (t) (x t)”. 
2! n!} n!} 


Applying Cauchy’s theorem to the pair of functions F(t), g(t) on the closed 
interval J (see relation (5.48)), we find a point € between xo and x at which 


F(x) ~— FQo) _ F'(§) 
g(x) — p(x) gE) 


Substituting the expression for F’(€) here and observing from comparison of 
formulas (5.50), (5.53) and (5.54) that F(x) — F(xo) = 0 — F(xo0) = —rn (x0; x), 
we obtain formula (5.52). 
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Setting g(t) = x —t¢ in (5.52), we obtain the following corollary. 


Corollary 1 (Cauchy’s formula for the remainder term) 


1 
Ta (X0; X) = fe ee —&)"(x — x9). (5.55) 


A particularly elegant formula results if we set g(t) = (x — t)"+! in (5.52): 


Corollary 2 (The Lagrange form of the remainder) 


In(03 2) = hee apy (5.56) 


We remark that when xo = 0 Taylor’s formula (5.51) is often called MacLaurin’s 
formula.” 
Let us consider some examples. 


Example 3 For the function f(x) = e* with x9 = 0 Taylor’s formula has the form 


1 1 1 
e* aida al Se aa <x! "+ rn (0; x), (5.57) 


and by (5.56) we can assume that 


1 
rn (O; x) = ———e> - "#1 
(n+ 1)! 
where |&| < |x|. 
Thus 
1 [xiert 

0; x)| = ——~e? - |x|"*? a 5.58 
[rn (0; x)| @+D!° |x| be are (5.58) 
But for each fixed x € R, if n — oo, the quantity et. as we know (see Exam- 


ple 12 of Sect. 3.1.3), tends to zero. Hence it follows from the estimate (5.58) and 
the definition of the sum of a series that 


1 1 L 
e Sl+ox+ ae +- oe Mee. (5.59) 


forallx ER. 


!2C. MacLaurin (1698-1746) — British mathematician. 
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Example 4 We obtain the expansion of the function a* for any a,0 <a,aF1l, 
similarly: 


Example 5 Let f(x) = sinx. We know (see Example 18 of Sect. 5.2.6) that 
f(x) = sin(x + zn), n EN, and so by Lagrange’s formula (5.56) with x9 = 0 
and any x € R we find 


1 : ad n+1 
rn (0; x)= aain(e+ Fen) 7 (5.60) 


from which it follows that r,,(0; x) tends to zero for any x € R as n > oo. Thus we 
have the expansion 


sinx =x La ie seep Ey grr Ns ac (5.61) 
3 SI Qn4+ 1! 


for every x ER. 


Example 6 Similarly, for the function f(x) = cos x, we obtain 


1 a 
rn (0; x) = Gab! cos(é + aM + ye (5.62) 
and 
_ Loy 14 (=1)" 9, 
cosx = a ae veep Oni +.-- (5.63) 
forx ER. 


Example 7 Since sinh’ x = coshx and cosh’ x = sinhx, formula (5.56) yields the 
following expression for the remainder in the Taylor series of f(x) = sinhx: 


1 
(n+1)! 


rn(0; x) = fever, 


where f+) (€) = sinhé ifn is even and f+! (€) = coshé if n is odd. In any case 
| f+ (E)| < max{| sinhx|,|coshx|}, since |£| < |x|. Hence for any given value 
x € R we have r,(0, x) > 0 as n — oo, and we obtain the expansion 


1 1 1 
sahara fb Sa eee Pied gees (5.64) 


ar SI Qn+D!” 


valid for all x € R. 
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Example 8 Similarly we obtain the expansion 


1 1 
eoshx a1 4 a? 4 gp oe fo Men, (5.65) 


2! 4! (2n)! 


valid for any x € R. 


Example 9 For the function f(x) = In(1 + x) we have f(x) = oe so 
that the Taylor series of this function at xp = 0 is 
1 1 —| n-1 
Ind +x)=x ule al meee ) x" +r, (0; x). (5.66) 
n 
This time we represent 7, (0; x) using Cauchy’s formula (5.55): 
1 (-1)"n! 
rn (0; x) = nl +e)" (x — é)"x, 
or 
x —é n 
0; x) = (—1)"x( ——}, 5.67 
rn (0; x) = ( (T=) (5.67) 
where & lies between 0 and x. 
If |x| < 1, it follows from the condition that € lies between 0 and x that 
= = 2s 1- _ 
Ne 0 lal 
1+é J1+&| i= {5 | 1— |g 1—|0| 
Thus for |x| < 1 
nO; x)| < lal", (5.69) 
and consequently the following expansion is valid for |x| < 1: 
1 1 —1)""! 
In(l +x) =x — =x? + =x3 4! ) Paes. 2 (5.70) 


2, 3 n 


We remark that outside the closed interval |x| < 1 the series on the right-hand 
side of (5.70) diverges at every point, since its general term does not tend to zero if 
|x| > 1. 


Example 10 For the function (1+.x)“, where w € R, we have f(x) =a(a—1)--- 
(a —n+1)(1+-~x)*~”, so that Taylor’s formula at x9 = 0 for this function has the 
form 


—1 
(inte. Oe ek 
1! 2! 
— safe —n ed 
4 H@= DOMED ee, gy hes 


n! 
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Using Cauchy’s formula (5.55), we find 


(a — YAM) yy gyaenley _ gyty, (5.72) 


a 
rn (0; x) = 


where & lies between 0 and x. 
If |x| < 1, then, using the estimate (5.68), we have 


a(1-2)...(1-2)ia petit (5.73) 
1 n 


When n is increased by 1, the right-hand side of Eq. (5.73) is multiplied by |(1 — 
n+l: But since |x| < 1, we shall have |(1 — asl <q <1, independently of 
the value of aw, provided |x| < g < 1 and n is sufficiently large. 

It follows from this that r,(0;x) — 0 as n > oo for any a € R and any x in 
the open interval |x| < 1. Therefore the expansion obtained by Newton (Newton’s 


binomial theorem) is valid on the open interval |x| < 1: 


|rn (0; x)| < 


a 


a 


ap OOD eg 


a _ 
(1+x)*=1+ a a 


beg 
(5.74) 

We remark that d’ Alembert’s test (see Paragraph b of Sect. 3.1.4) implies that for 
|x| > 1 the series (5.74) generally diverges if a ¢ N. Let us now consider separately 
the case whena =n EN. 

In this case f(x) = (1+x)* = (14+ x)” is a polynomial of degree n and hence all 
of its derivatives of order higher than n are equal to 0. Therefore Taylor’s formula, 
together with, for example, the Lagrange form of the remainder, enables us to write 
the following equality: 

n n(n— 1 n(n—1)---1 
ipa oa (5.75) 
1! 2! n! 
which is the Newton binomial theorem known from high school for a natural- 
number exponent: 


dtayta14(T)x+(S)eee ("Ya 


Thus we have defined Taylor’s formula (5.51) and obtained the forms (5.52), 
(5.55), and (5.56) for the remainder term in the formula. We have obtained the rela- 
tions (5.58), (5.60), (5.62), (5.69), and (5.73), which enable us to estimate the error 
in computing the important elementary functions using Taylor’s formula. Finally, 
we have obtained the power-series expansions of these functions. 


Definition 6 If the function f(x) has derivatives of all orders n € N at a point xo, 
the series 


! 1 
f (x0) + pl Coe —xo)t---+ GF eorer — x0)" cer 


is called the Taylor series of f at the point xo. 
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It should not be thought that the Taylor series of an infinitely differen- 
tiable function converges in some neighborhood of xo, for given any sequence 
C0, C1,---,Cn,--. Of numbers, one can construct (although this is not simple to do) 
a function f(x) such that f™ (x9) = cn, n EN. 

It should also not be thought that if the Taylor series converges, it necessarily 
converges to the function that generated it. A Taylor series converges to the function 
that generated it only when the generating function belongs to the class of so-called 
analytic functions. 

Here is Cauchy’s example of a nonanalytic function: 


—1/x? I 
(= et sion 
0, ifx =0. 

Starting from the definition of the derivative and the fact that x*e~!/ x” _. Oas 
x —> 0, independently of the value of k (see Example 30 in Sect. 3.2), one can verify 
that f™) = 0 forn = 0,1,2,.... Thus, the Taylor series in this case has all its 
terms equal to 0 and hence its sum is identically equal to 0, while f(x) 4 Oif x £0. 

In conclusion, we discuss a local version of Taylor’s formula. 

We return once again to the problem of the local representation of a function 
f : E > R by a polynomial, which we began to discuss in Sect. 5.1.3. We wish to 
choose the polynomial P, (x9; x) = x9 + c1(x — x0) +--+ + n(x — x0)” So as to 
have 


f (x) = Pa(x) +0((x — x0)") asx > x0,x € E, 
or, in more detail, 


f(x) =coter(x — x9) +++ + en (x — x0)" + o((x — x0)") asx—>x0,x EE. 
(5.76) 
We now state explicitly a proposition that has already been proved in all its es- 
sentials. 


Proposition 3 [f there exists a polynomial Py(xo; x) = co + c1(x — x0) Hee + 
Cn(x — x9)" satisfying condition (5.76), that polynomial is unique. 


Proof Indeed, from relation (5.76) we obtain the coefficients of the polynomial suc- 
cessively and completely unambiguously 


co= _lim f(s), 


E>x—>xo 
: f(x) — co 
c= lim — —_, 


E>x>x) X—X0O 


f(x) = [eo Fe en—1& = x0)" 1] 
Ch = _ lim : 
Edx—>xo (x — xo)” 


We now prove the local version of Taylor’s theorem. 
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Proposition 4 (The local Taylor formula) Let E be a closed interval having xp € R 


as an endpoint. If the function f : E — R has derivatives f'(xo),..., f (xo) up 
to order n inclusive at the point xo, then the following representation holds: 
fo) fo) 
f(x) = fo) + 1 1g) Poet ig) 
+ 0((x — x0)") asx —>x9, x EE. (5.77) 


Thus the problem of the local approximation of a differentiable function is solved 
by the Taylor polynomial of the appropriate order. 

Since the Taylor polynomial P,(xo; x) is constructed from the requirement 
that its derivatives up to order n inclusive must coincide with the corresponding 
derivatives of the function f at xo, it follows that f ) (x9) - P& (x0; Xo) = 0 
(k =0,1,...,m) and the validity of formula (5.77) is established by the following 
lemma. 


Lemma 2 /f a function g : E > R, defined on a closed interval E with endpoint 
xq, is such that it has derivatives up to order n inclusive at x9 and 9(xo) = 9' (x0) = 
+» = 9 (xq) = 0, then g(x) = o((x — xo)") as x > x0, x € E. 


Proof For n = 1 the assertion follows from the definition of differentiability of the 
function ¢ at xo, by virtue of which 


p(x) = 9(x0) + y' (x0) (x — x0) FO(X— x0) asx > x0, x EE, 
and, since y(xo) = y’(xo) = 0, we have 
p(x) =o(x— x0) asx—>xo, XE E. 


Suppose the assertion has been proved for order n = k — 1 > 1. We shall show 
that it is then valid for order n = k > 2. 
We make the preliminary remark that since 


(k-1) (k—1) 

_ : Q (x) -—@ (xo) 

e™ (xo) = (*~)' (xo) = _ lim 
E>x—>x0 xX — XO 


the existence of ge (x9) presumes that the function g*-D(x) is defined on E, at 
least near the point x9. Shrinking the closed interval EF if necessary, we can assume 
from the outset that the functions g(x), y’(x),..., gk-Dex), where k > 2, are all 
defined on the whole closed interval E with endpoint xo. Since k > 2, the function 
y(x) has a derivative g’(x) on E, and by hypothesis 


k-1 
(y')'@o) =--- =e) @0) = 0. 
Therefore, by the induction assumption, 


g(x) =o((x — xo!) asx > x9,x EE. 
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Then, using Lagrange’s theorem, we obtain 


g(x) = g(x) — o(x0) = g' (E(x — x0) = (ENE — x0) * Pw — x0), 


where & lies between xo and x, that is, |E — xo| < |x — xo|, and a(&) ~ OasE > 
xo, € € E. Hence as x > xo, x € E, we have simultaneously € > xo, € € E, and 
a(é) > 0. Since 


|o(x)| < Jaé)||x — xol* 7! |x — xol, 


we have verified that 


p(x) =0((x — xo)*) asx —>x9,x € E. 


Thus, the assertion of Lemma 2 has been verified by mathematical induction. 


Relation (5.77) is called the local Taylor formula since the form of the remainder 
term given in it (the so-called Peano form) 


Tn (X03 x) = 0((x — x0)"), (5.78) 


makes it possible to draw inferences only about the asymptotic nature of the con- 
nection between the Taylor polynomial and the function as x > xo, x € E. 
Formula (5.77) is therefore convenient in computing limits and describing the 
asymptotic behavior of a function as x > xo, x € E, but it cannot help with the 
approximate computation of the values of the function until some actual estimate of 
the quantity r;, (x0; x) = o((x — xo)”) is available. 
Let us now summarize our results. We have defined the Taylor polynomial 


(n) 
Py (03x) = fag) + OO (w — xg) + pe 0) — xo)”, 
written the Taylor formula 
os f© (xo) 


f(x) = fo) + (x — X0) + + =] — (& = 20)" + X05), 


and obtained the following very important specific form of it: 


If f has a derivative of order n + 1 on the open interval with endpoints xo and x, 
then 


= me) 


f(x) = f (x0) + Go aq) ret — oe) 


por gh 
map!" — x0) ; 


(5.79) 


where & is a point between xo and x. 
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If f has derivatives of orders up to n = 1 inclusive at the point xo, then 


(n) 
FOO) x9) + ae 0) — xo)" +0((x —x0)"). (5.80) 


f(x) = fo) + 


Relation (5.79), called Taylor’ s formula with the Lagrange form of the remainder, 
is obviously a generalization of Lagrange’s mean-value theorem, to which it reduces 
when n = 0. 

Relation (5.80), called Taylor’s formula with the Peano form of the remainder, 
is obviously a generalization of the definition of differentiability of a function at a 
point, to which it reduces when n = 1. 

We remark that formula (5.79) is nearly always the more substantive of the two. 
For, on the one hand, as we have seen, it enables us to estimate the absolute mag- 
nitude of the remainder term. On the other hand, when, for example, f +I) (x) is 
bounded in a neighborhood of xo, it also implies the asymptotic formula 


(n) 
£00) = f00) + LOO a — 29) +--+ OO (a — 2)" + O(@ —x0)"*"). 


(5.81) 
Thus for infinitely differentiable functions, with which classical analysis deals in the 
overwhelming majority of cases, formula (5.79) contains the local formula (5.80). 
In particular, on the basis of (5.81) and Examples 3-10 just studied, we can now 
write the following table of asymptotic formulas as x —> 0: 


eee ot x" + O(n"), 


cosy a1 — ox? + ext aot CY 9 4 OG?) 
2 (2n)! 
tau 1 3 1 5 (-1)” 2n+1 n+3 
sinx =x ai" + 3% aati Qn+ Di” + O(x ), 
: J 2n 2n+2 
cosh = 1+ 5 27 +a" betes Gas ROG), 
eee Par Abe ga 1 gent EO): 
3! 5! (2n + 1)! 
1 1 —~1)" 
Indl+x)=x ax +3" seals! ) x" + O(x"*), 
n 


(+x)*=14+ 5x 2p EO pe ) “ n+ Oe 
. Nn: 


+ O(x"*), 
Let us now consider a few more examples of the use of Taylor’s formula. 


Example 11 We shall write a polynomial that makes it possible to compute the val- 
ues of sinx on the interval —1 < x < 1 with absolute error at most 10-3. 
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One can take this polynomial to be a Taylor polynomial of suitable degree ob- 
tained from the expansion of sin x in a neighborhood of x9 = 0. Since 


ie a (—1)" 


31 + 5% a Ona TOw TY + ron42(0: x), 


where by Lagrange’s formula 


sin + 5 (2n + 3)) ant3 
(2n +3)! 


r2n+2(0; x)= 
we have, for |x| < 1, 


rn+2(0; x)| = = 
| 2n+2( | (2n +3)! 

but TET < 10~? for n > 2. Thus the approximation sin x * x — tr + ax has the 
required precision on the closed interval |x| < 1. 


Example 12 We shall show that tanx = x + 5x + 0(x*) as x > 0. We have 


tan’ x = cos’ x, 


3 


tan” x = 2cos ° x sinx, 


4 2 


tan” x = 6cos~* x sin? x + 2cos~ x. 

Thus, tan0 = 0, tan’ 0 = 1, tan” 0 = 0, tan’” 0 = 2, and the relation now follows 
from the local Taylor formula. 
Example 13 Let a > 0. Let us study the convergence of the series }°)—; Incos <7. 
For a > 0 we have + — 0 as n-— oo. Let us estimate the order of a term of the 


series: 
1 1 1 1 1 1 1 
Incos a =In{[ 1 ry : “a +o 2a — 5 : ou +0 waa |" 


Thus we have a series of terms of constant sign whose terms are equivalent to 
those of the series )7 "| 3,20 Since this last series converges only for a > 5, when 


a > 0 the original series converges only for a > 5 (see Problem 15b) below). 


Example 14 Let us show that Incosx = 5x? 5x4 ax? + O(x®) asx > 0. 
This time, instead of computing six successive derivatives, we shall use the 


already-known expansions of cosx as x > 0 and In(1 + uw) as u > 0: 
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1 1 1 
incosx = In(1 ae ax’ + (x8) =In(1 +1) = 


2! 4! 
1 
=u su’ + u>+O(u*) = 
1, 14, 166 


Example 15 Let us find the values of the first six derivatives of the function Incos x 
at x = 0. 

We have (Incos)’x = , and it is therefore clear that the function has deriva- 
tives of all orders at 0, since cos 0 # 0. We shall not try to find functional expressions 
for these derivatives, but rather we shall make use of the uniqueness of the Taylor 
polynomial and the result of the preceding example. 

If 


—sinx 


f (x) =co + crx +++ + enx" + 0(x") as x > 0, 
then 


f@) 


i and f(0) =kleg. 


Ck = 


Thus, in the present case we obtain 
1 
(Incos)(0) = 0, (Incos)’(0) = 0, (Incos)” (0) = — os 2!, 


1 
(Incos)°) (0) = 0, (Incos) (0) = — rok Al, 


1 
(Incos)®) (0) = 0, (Incos) (0) = — oh 6!. 


Example 16 Let f(x) be an infinitely differentiable function at the point xo, and 
suppose we know the expansion 


fx) Sep teyx te +x" + O(x"t!) 


of its derivative in a neighborhood of zero. Then, from the uniqueness of the Taylor 
expansion we have 


(f')°@ =e, 
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and so f +) (Q) = ki)... Thus for the function f(x) itself we have the expansion 


1,/ 


co Le} 2, nC, n+l n+2 
IOSIOR aye ee a + O(x"**), 


or, after simplification, 


/ 


co clo C +1 a2) 
FO)=FO + Pat Gar tet Hath + O(n), 


Example 17 Let us find the Taylor expansion of the function f(x) = arctan x at 0. 


Since f’(x) = a =(14x7)-!=1— x7 4x4 —.-- 4+ (-1)" x2" + O(x7"*), 


by the considerations explained in the preceding example, 


1 1 1 (-1)? 
0 3 5 as 2n+1 O 2n+3 : 
EON SSO) 7 ae + Ee +5 * + (x ) 
that is, 
1 : 1 5 ( 1)” 2n+1 2n+3 
t = + veep + O(xT?). 
arctanx =x ae 5* on 7s (x ) 


Example 18 Similarly, by expanding the function arcsin’ x = (1 — x*)~!/? by Tay- 
lor’s formula in a neighborhood of zero, we find successively, 


1 r 1 ) 
(l+uy'? =14+ et 2 i a one 
eA 1 
= = (Gee = n+1 
ip 3(-5 ok 3 ’ "4 O(u"*), 
1 1-3 
—x)7l/2 — -y2 Aeris 4.23 
(—ay Fait ox? + at pt 
creas sia 
eT io) Cia 
Sit 2 ia ed 
arcsinx = x x Pee 
2-3 22.2!-5 
(Qn 1)! 


(2n)!!(2n + 1) 


or, after elementary transformations, 


| aie 2n — 111] 
Sy ae ee ee ] ae [(2n — 1)!!] 


3! 5! "n+ D! 


ont +O io F 


Here (2n — 1)! :=1-3-+-(2n — 1) and (2n)!!:=2-4---(2n). 
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Example 19 We use the results of Examples 5, 12, 17, and 18 and find 


arctan x — sinx . k= 5x? + O(x°)] —[x — ae + O(x°)] 


im = lim 
x>0tanx —arcsinx x>0[x+4 5x3 + O(x5)] —[x+ 4x3 + O(x>)] 


5.3.4 Problems and Exercises 


1. Choose numbers a and b so that the function f(x) = cosx — Lar, is an in- 
finitesimal of highest possible order as x — 0. 

2. Find limy +0 xf — (-4)"1. 

3. Write a Taylor polynomial of e* at zero that makes it possible to compute the 
values of e* on the closed interval —1 <x <2 within 1073. 

4. Let f be a function that is infinitely differentiable at 0. Show that 


a) if f is even, then its Taylor series at 0 contains only even powers of x; 
b) if f is odd, then its Taylor series at 0 contains only odd powers of x. 


5. Show that if f ¢ C© [—1,1] and f™ (0) = 0 for n =0,1,2,..., and there 
exists a number C such that sup_)<,< |f™(x)| <an!C for n €N, then f =0 on 
(=i, 1): 

6. Let f €C™(-1, 1D and sup_,_,-1 | f(x)| < 1. Let me (1) = infyer |f (X)|, 
where / is an interval contained in ]—1, 1[. Show that 


a) if 7 is partitioned into three successive intervals [;, J2, and J3 and yp is the 
length of J, then 


my(1) < —(mg—1(11) + m1); 


ele 


b) if 7 has length 4, then 


Qk(k+1)/2k 


mI) = —y— 
c) there exists a number a, depending only on n such that if | f’(0)| > a, then 
the equation f ")(x) = 0 has at least n — 1 distinct roots in ]—1, 1[. 


Hint: In part b) use part a) and mathematical induction; in c) use a) and prove by 
induction that there exists a sequence x, < Xx, < ++: < Xz, Of points of the open 
interval ]—1, 1[ such that f (xx,)- f (xg,,,) < O for 1<i<k—-1. 

7. Show that if a function f is defined and differentiable on an open interval J and 
[a,b] CT, then 
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a) the function f’(x) (even if it is not continuous!) assumes on [a, b] all the 
values between f’(a) and f’(b) (the theorem of Darboux);'* 

b) if f” (x) also exists in Ja, b[, then there is a point € € Ja, b[ such that f’(b) — 
f(a) = f"(E\(b—a). 


8. A function f(x) may be differentiable on the entire real line, without having a 
continuous derivative f’(x) (see Example 7 in Sect. 5.1.5). 


a) Show that f’(x) can have only discontinuities of second kind. 
b) Find the flaw in the following “proof” that f’(x) is continuous. 


Proof Let xo be an arbitrary point on R and f’(xo) the derivative of f at the point 
Xo. By definition of the derivative and Lagrange’s theorem 


G0) = im LP ZEY = im fe) = Jim f'@), 


x>x0 x— x0 x—>xXQ 


where & is a point between xg and x and therefore tends to x9 as x > Xo. 


9. Let f be twice differentiable on an interval J. Let Mo = sup,<;|f(x)|, Mi = 
supyc, |f/(x)| and M2 = sup, | f’”(x)|. Show that 


a) if J =[—a, a], then 


M, <2./MoM2, if the length of J is not less than 2 ./Mo/M2, 
M, < J/2MoM2, if 1=R; 
c) the numbers 2 and V2 in part b) cannot be replaced by smaller numbers; 
d) if f is differentiable p times on IR and the quantities Mo and M, = 
SUP, eR | f‘)(x)| are finite, then the quantities M; = SUPyeR |f(x)|, l<k <p, 
are also finite and 


M; < Peay 


k/ 
M, P. 
Hint: Use Exercises 6b) and 9b) and mathematical induction. 
10. Show that if a function f has derivatives up to order n + 1 inclusive at a point xo 


and f+ (x9) 40, then in the Lagrange form of the remainder in Taylor’s formula 


1 
inne) = = ae (xo + O(x — x0) (x — x0)", 
where 0 < 6 < | and the quantity 6 = 6(x) tends to at as Xx > Xo. 
11. Let f be a function that is differentiable n times on an interval 7. Prove the 
following statements. 


'3G, Darboux (1842-1917) — French mathematician. 
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a) If f vanishes at (n + 1) points of J, there exists a point € € J such that 


fM(Eé)=0. 


b) If x1, x2,..., xp are points of the interval J, there exists a unique polynomial 
L(x) (the Lagrange interpolation polynomial) of degree at most (n — 1) such that 
f(x) = LQ), i=1,...,n. In addition, for x € 7 there exists a point € € J such 
that 


(x — X1) +++ (% — Xn) 
n! 


f@®. 


c) If x) < x2 <--- < Xp are points of J and n;, | <i < p, are natural numbers 
such that nj +2 +--:+np, =n and f™ (xi) = 0 for 0 <k <n; — 1, then there 
exists a point & in the closed interval |x), x,] at which f?-Y(E) =0. 

d) There exists a unique polynomial H(x) (the Hermite interpolating polyno- 
mial)'* of degree (n — 1) such that f“ (x;) = H“ (x;) for 0 < k <n; — 1. More- 
over, inside the smallest interval containing the points x and xj,i=1,..., p, there 
is a point € such that 


fe) -L@)= 


(x — x1)"! +--+ (x — Xp) 
n! 


Np 
f(x) =H) + f@). 

This formula is called the Hermite interpolation formula. The points x;,i = 
1,..., p, are called interpolation nodes of multiplicity n; respectively. Special cases 
of the Hermite interpolation formula are the Lagrange interpolation formula, which 
is part b) of this exercise, and Taylor’s formula with the Lagrange form of the re- 
mainder, which results when p = 1, that is, for interpolation with a single node of 
multiplicity n. 


12. Show that 


a) between two real roots of a polynomial P(x) with real coefficients there is a 
root of its derivative P’(x); 

b) if the polynomial P(x) has a multiple root, the polynomial P’(x) has the 
same root, but its multiplicity as a root of P’(x) is one less than its multiplicity as a 
root of P(x); 

c) if Q(x) is the greatest common divisor of the polynomials P(x) and P’(x), 
where P’(x) is the derivative of P(x), then the polynomial ah has the roots of 
P(x) as its roots, all of them being roots of multiplicity 1. 


13. Show that 


a) any polynomial P(x) admits a representation in the form co + c1(x — x0) + 
ene Ca(X = XO)" 

b) there exists a unique polynomial of degree n for which f(x) — P(x) = o0((x — 
xo)”) as E > x — xq. Here f is a function defined on a set E and xg is a limit point 
of E. 


'4Ch. Hermite (1822-1901) — French mathematician who studied problems of analysis; in partic- 
ular, he proved that e is transcendental. 
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14. Using induction on k, 1 < k, we define the finite differences of order k of the 
function f at xo: 
A! f (x0; 1) := Af (xo; M1) = fo +41) — fo), 
A? f (x0; hi, h2) = AAF (x0; hi, ha) = 
= (fo +h +h2) — fo +h2)) - 
— (f+ h1) — f G0) = 
= f(xo + hi +ho) — f(xo +h) — fo +h2) + fo), 


AF f (x0; hi, --.5 Ax) = AR! gh (x0; M1, «.- Ak-1), 


where g(x) = A! f(x; hg) = f(x + hx) — f(x). 


a) Let f ¢ C“-)[a, b] and suppose that f(x) exists at least in the open in- 
terval Ja, b[. Show that if all the points xo, x9 +/1,x0 th2, xo thi +ho,...,x0+ 
hi +---+ hy lie in [a, b], then inside the smallest closed interval containing all of 
them there is a point € such that 


A" f (xo; hi, -.-s hn) = f (hr +n. 
b) (Continuation.) If f (xo) exists, then the following estimate holds: 


|A” f (xo: hi, ---sftn) — f oar hn| < 


< sup | fx) — fF (o)| - Wail [aal- 
xéja,bl 


c) (Continuation.) Set A” f (xo; h,..., h) =: A” f (xo; A”). Show that if f™ (x9) 
exists, then 
A fGen") 
(4) — 
i a 


d) Show by example that the preceding limit may exist even when f”) (xo) does 
not exist. 


Hint: Consider, for example, A? FO; h?) for the function 


x3 sin +, x #0, 
x)= Mg 
fx) 0, peut 


and show that 


__ A* f (0; h?) 
lim ———— = 


0. 
h>0 h2 
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1 
xa? 


1 1 1 1 
< 
nite ~a@l\(n—1)*  n& 
holds forn € N anda > 0. 


b) Use the result of a) to show that the series gas 1 + converges foro > 1. 


15. a) Applying Lagrange’s theorem to the function 
the inequality 


where a > 0, show that 


5.4 The Study of Functions Using the Methods of Differential 
Calculus 


5.4.1 Conditions for a Function to be Monotonic 


Proposition 1 The following relations hold between the monotonicity properties of 
a function f : E = R that is differentiable on an open interval |a, b| = E and the 
sign (positivity) of its derivative f' on that interval: 


f'~)>0=> ~~ fisincreasing => f'(x)=0, 
f'(x)=0=5 _ f isnondecreasing => f'(x)=0, 
fix) =0=> f =const. => f'(x) =0, 
f'(x)<0= _ f isnonincreasing => f'(x) <0, 
f'~)<0=> — f isdecreasing => f'(x) <0. 


Proof The left-hand column of implications is already known to us from Lagrange’s 
theorem, by virtue of which f (x2) — f(x1) = f’(€)(x2 — x1), where x1, x2 € Ja, D[ 
and € is a point between x; and x2. It can be seen from this formula that for x1 < x2 
the difference f (x2) — f (x1) is positive if and only if f’(&) is positive. 

The right-hand column of implications can be obtained immediately from the 
definition of the derivative. Let us show, for example, that if a function f that is 
differentiable on Ja, b[ is increasing, then f’(x) > 0 on Ja, b[. Indeed, 


. fath)— fi) 
if = lim —H—W——_. 
f'@) = lim 7 
If h > 0, then f(x +h) — f(x) > 0; and if h <0, then f(x +h) — f(x) < 0. 
Therefore the fraction after the limit sign is positive. 
Consequently, its limit f’(x) is nonnegative, as asserted. 


Remark I It is clear from the example of the function f(x) = x> that a strictly 
increasing function has a nonnegative derivative, not necessarily one that is always 
positive. In this example, f’(0) = 3x7|,—o9 = 0. 
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Remark 2 In the expression A => B, as we noted at the appropriate point, A is a 
sufficient condition for B and B a necessary condition for A. Hence, one can make 
the following inferences from Proposition 1. 

A function is constant on an open interval if and only if its derivative is identically 
zero on that interval. 

A sufficient condition for a function that is differentiable on an open interval to 
be decreasing on that interval is that its derivative be negative at every point of the 
interval. 

A necessary condition for a function that is differentiable on an open interval to 
be nonincreasing on that interval is that its derivative be nonpositive on the interval. 


Example I Let f(x) =x? — 3x +2 on R. Then f’(x) = 3x? — 3 = 3(x? — 1), and 
since f’(x) <0 for |x| < 1 and f’(x) > 0 for |x| > 1, we can say that the function 
is increasing on the open interval ]—oo, —1[, decreasing on ]—1, 1[, and increasing 
again on ]1, +oo[. 


5.4.2 Conditions for an Interior Extremum of a Function 


Taking account of Fermat’s lemma (Lemma | of Sect. 5.3), we can state the follow- 
ing proposition. 


Proposition 2 (Necessary conditions for an interior extremum) In order for a point 
Xo to be an extremum of a function f : U (x9) > R defined on a neighborhood U (xo) 
of that point, a necessary condition is that one of the following two conditions hold: 
either the function is not differentiable at xo or f'(xo) = 0. 


Simple examples show that these necessary conditions are not sufficient. 


Example 2 Let f(x) = x? on R. Then f'(0) = 0, but there is no extremum at 
xo = 0. 


Example 3 Let 


F(x) x forx>0, 
x)= 
2x forx <0. 

This function has a bend at 0 and obviously has neither a derivative nor an ex- 
tremum at 0. 


Example 4 Let us find the maximum of f(x) = x? on the closed interval [—2, 1]. 
It is obvious in this case that the maximum will be attained at the endpoint —2, but 
here is a systematic procedure for finding the maximum. We find f’(x) = 2.x, then 
we find all points of the open interval ]—2, 1[ at which f’(x) = 0. In this case, the 
only such point is x = 0. The maximum of f(x) must be either among the points 
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where f’(x) =0, or at one of the endpoints, about which Proposition 2 is silent. 
Thus we need to compare f(—2) = 4, f(0) =0, and f(1) = 1, from which we 
conclude that the maximal value of f(x) = x” on the closed interval [—2, 1] equals 
4 and is assumed at —2, which is an endpoint of the interval. 


Using the connection established in Sect. 5.4.1 between the sign of the deriva- 
tive and the nature of the monotonicity of the function, we arrive at the following 
sufficient conditions for the presence or absence of a local extremum at a point. 


Proposition 3 (Sufficient conditions for an extremum in terms of the first derivative) 

Let f : U(xo) > R be a function defined on a neighborhood U (xo) of the point xo, 

which is continuous at the point itself and differentiable in a deleted neighborhood 

U (xo). Let U~ (xo) = {x € U(x) | x < xo} and Ut (xo) = {x € U(x0) | x > xo}. 
Then the following conclusions are valid: 


a) (Wx € U~(xo) (f’(x) < 0)) A (Wx € U* (xo) (f'(x) < 0)) > (f has no 
extremum at xq); 

b) (Wx € U~ (x0) (f(x) < 0)) A (Wx € U* (x0) (f'(x) > 0)) > (xo is a Strict 
local minimum of f); 

c) (Wx € U~(x0) (f/(x) > 0)) A (Wx € Ut(x0) (f(x) < 0)) = (x0 is a strict 
local maximum of f); 

d) (Wx € U~ (xo) (f/(x) > 0)) A (Wx € Ut (x0) (f'(x) > 0)) = (f has no 


extremum at xo). 


Briefly, but less precisely, one can say that if the derivative changes sign in pass- 
ing through the point, then the point is an extremum, while if the derivative does not 
change sign, the point is not an extremum. 

We remark immediately, however, that these sufficient conditions are not neces- 
sary for an extremum, as one can verify using the following example. 


Example 5 Let 


2x2+4+x%sin+ for x 40, 
f(x) = 7 
0 for x = 0. 


Since x? < f(x) < 3x2, it is clear that the function has a strict local minimum 
at x9 = 0, but the derivative f’(x) = 4x + 2x sin 1 — cos i is not of constant sign 
in any deleted one-sided neighborhood of this point. This same example shows the 
misunderstandings that can arise in connection with the abbreviated statement of 


Proposition 3 just given. 
We now turn to the proof of Proposition 3. 


Proof a) It follows from Proposition 2 that f is strictly decreasing on U~ (xo). Since 
it is continuous at x9, we have lim, Gayseex f(x) = f(%o), and consequently 


f(x) > f(xo) for x € U~ (xo). By the same considerations we have f (xo) > f(x) 
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for x € Ut (xq). Thus the function is strictly decreasing in the whole neighborhood 
U(xo) and xo is not an extremum. 

b) We conclude to begin with, as in a), that since Mg (x) is decreasing on U- (xo) 
and continuous at x9, we have f(x) > f (xo) for x € U- (xo). We conclude from the 
increasing nature of f on U* (xo) that f(xo) < f(x) forxe€ U* (xo). Thus f has a 
strict local minimum at xo. 

Statements c) and d) are proved similarly. 


Proposition 4 (Sufficient conditions for an extremum in terms of higher-order 
derivatives) Suppose a function f : U(xo) > R defined on a neighborhood U (xo) 
of xo has derivatives of order up to n inclusive at xo (n => 1). 

If f' (xo) =--- = f° (x9) =0 and f™ (xo) £0, then there is no extremum at 
xo ifn is odd. If n is even, the point xg is a local extremum, in fact a strict local 
minimum if f (xo) > 0 and a strict local maximum if f ™ (xo) <0. 


Proof Using the local Taylor formula 


1 
f(x) — f (xo) = Sf ore — x9)" + a(x) (x — x0)", (5.82) 


where a(x) — 0 as x — xo, we shall reason as in the proof of Fermat’s lemma. We 
rewrite Eq. (5.82) as 


1 
f(x) — f (x0) = (70 + x(s)) (x — x0)". (5.83) 


Since f (xo) 4 0 and a(x) > 0 as x > xo, the sum f™) (x9) + a(x) has the 
sign of f (x9) when x is sufficiently close to xo. If n is odd, the factor (x — xo)” 
changes sign when x passes through xo, and then the sign of the right-hand side of 
Eq. (5.83) also changes sign. Consequently, the left-hand side changes sign as well, 
and so for n = 2k + | there is no extremum. 

If n is even, then (x — x0)” > 0 for x 4 xo and hence in some small neighborhood 
of xo the sign of the difference f(x) — f (xo) is the same as the sign of f ™ (x9), as 
is clear from Eq. (5.83). 


Let us now consider some examples. 


Example 6 (The law of refraction in geometric optics (Snell’s law)!°) According to 
Fermat’s principle, the actual trajectory of a light ray between two points is such 
that the ray requires minimum time to pass from one point to the other compared 
with all paths joining the two points. 

It follows from Fermat’s principle and the fact that the shortest path between two 
points is a straight line segment having the points as endpoints that in a homoge- 
neous and isotropic medium (having identical structure at each point and in each 
direction) light propagates in straight lines. 


'SW. Snell (1580-1626) — Dutch astronomer and mathematician. 
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Fig. 5.10 


Now consider two such media, and suppose that light propagates from point A, 
to Az, as shown in Fig. 5.10. 
If c, and cp are the velocities of light in these media, the time required to traverse 


the path is 
1 2 1 2 
t(x)=—/ht + x24 — hz + (a —x)?. 
Cl c2 


We now find the extremum of the function f(x): 


x 1 a-x 


1 
t'(x) = = 0, 
1 neta? [nb + (ax)? 


which in accordance with the notation of the figure, yields a sina = Cy | sin a2. 

It is clear from physical considerations, or directly from the form of the function 
t(x), which increases without bound as x > oo, that the point where t’(x) = 0 is an 
absolute minimum of the continuous function t(x). Thus Fermat’s principle implies 
the law of refraction 2% — 4 


sin a2 C2" 


Example 7 We shall show that for x > 0 


x*-—ax+a—1<0, when0<a <1, (5.84) 


x*-—ax+ta—1>0, whena<Oorl <a. (5.85) 


Proof Differentiating the function f(x) = x* —ax +a — 1, we find f’(x) = 
a(x?! — 1) and f’(x) =0 when x = 1. In passing through the point | the derivative 
passes from positive to negative values if 0 < a < | and from negative to positive 
values if a <0 or a > 1. In the first case the point | is a strict maximum, and in 
the second case a strict minimum (and, as follows from the monotonicity of f on 
the intervals 0 < x < 1 and | < x, not merely a local minimum). But f(1) = 0 and 
hence both inequalities (5.84) and (5.85) are established. In doing so, we have even 
shown that both inequalities are strict if x ~ 1. 


We remark that if x is replaced by | + x, we find that (5.84) and (5.85) are 
extensions of Bernoulli’s inequality (Sect. 2.2; see also Problem 2 below), which 
we already know for a natural-number exponent a. 
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By elementary algebraic transformations one can obtain a number of classical 
inequalities of great importance for analysis from the inequalities just proved. We 
shall now derive these inequalities. 


a. Young’s Inequalities!° 


If a > 0 and b > 0, and the numbers p and q such that p # 0,1, g 40,1 and 
7 + 7 = 1, then 


1 1 

alPpi<—¢@4—b if ps1, (5.86) 
Pq 
1 41 

Gh > eto ofp al; (5.87) 
P 4 


and equality holds in (5.86) and (5.87) only when a = b. 


Proof It suffices to set x = 5 anda = ; in (5.84) and (5.85), and then introduce the 


notation 1 =1— 1. 
q P 


b. Hélder’s Inequalities!’ 


Let x; > 0, yj 20,i=1,...,n,and 5 +2=1. Then 


n n I/p n 1/q 
ea (> ) (> st) for p>, (5.88) 
i=1 i=1 i=1 


and 


n 


n n 1/p 1/q 
> a (s>+") (s>>?] for p <1,p#£0. (5.89) 
i=1 i=1 i=l 


In the case p <0 it is assumed in (5.89) that x1 > 0 G@ =1,...,n). Equality is 
possible in (5.88) and (5.89) only when the vectors Gs er xP) and erie er yf) 
are proportional. 


Proof Let us verify the inequality (5.88). Let X = pare > 0 and Y= 
-P Ve 
YL yf > 0. Setting a= + and b = + in (5.86), we obtain 
MiYi 1 x 1 ye 
X'/pylla— pX ' GY 


!OW.H. Young (1882-1946) — British mathematician. 
170. Hélder (1859-1937) — German mathematician. 
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Summing these inequalities over i from | to n, we obtain 


LiaiXiVi 2 y 


X!/pyl/¢ — 


’ 


which is equivalent to relation (5.88). 

We obtain (5.89) similarly from (5.87). Since equality occurs in (5.86) and (5.87) 
only when a = b, we conclude that it is possible in (5.88) and (5.89) only when a 
proportionality bai = ryt or ye = Ax? holds. 


c. Minkowski’s Inequalities !* 


Let x; => 0, y, > 0,i=1,...,n. Then 


n 1/p n 1/p n 1/p 
(Se + »") < bay + (» ) when p>1, — (5.90) 
i=1 i=1 i=1 


and 


n 1/p n 1/p n 1/p 
(de a »") > (+) + (> ) when p <1, p40. (5.91) 
i=l i=l il 


Proof We apply Hélder’s inequality to the terms on the right-hand side of the iden- 
tity 


n n n 
Yi@i ty)? =o xiGit yi)? + > yiGi ty)? 
i=l i=l i=l 
The left-hand side is then bounded from above (for p > 1) or below (for p < 1) 
in accordance with inequalities (5.88) and (5.89) by the quantity 


n l/pyn I/q n l/pyn 1/q 
(> “) (De + »") + (> sf) (De + »") : 
i=l i=l 


i=l i=l 


After dividing these inequalities by ()~"_, (x; + y;)?)!/7, we arrive at (5.90) and 
(5.91). 

Knowing the conditions for equality in Hélder’s inequalities, we verify that 
equality is possible in Minkowski’s inequalities only when the vectors (x1, ..., Xn) 
and (y1,..-, Yn) are collinear. 


For n = 3 and p = 2, Minkowski’s inequality (5.90) is obviously the triangle 
inequality in three-dimensional Euclidean space. 


'8H. Minkowski (1864-1909) — German mathematician who proposed a mathematical model 
adapted to the special theory of relativity (a space with a sign-indefinite metric). 
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Fig. 5.11 (x2, f(@2)) 
(a1%1 + agx2, 01 f (x1) + a2f(r2)) 


Example § Let us consider another elementary example of the use of higher-order 
derivatives to find local extrema. Let f(x) = sinx. Since f’(x) =cosx and f"(x) = 
—sinx, all the points where f’(x) = cosx = 0 are local extrema of sinx, since 
f(x) = —sinx 4 0 at these points. Here f”(x) < 0 if sinx > 0 and f”(x) > 0 
if sinx <0. Thus the points where cos x = 0 and sinx > 0 are local maxima and 
those where cosx = O and sinx < 0 are local minima for sinx (which, of course, 
was already well-known). 


5.4.3 Conditions for a Function to be Convex 


Definition 1 A function f : Ja, bl — R defined on an open interval Ja, b[ C R is 
convex if the inequalities 


f (ayx1 +.02x2) < a f (x1) + 2 f (x2) (5.92) 


hold for any points x1, x2 € Ja, b[ and any numbers a > 0, a2 > 0 such that a; + 
a2 = |. If this inequality is strict whenever x; 4 x2 and aja2 # 0, the function is 
strictly convex on Ja, b[. 


Geometrically, condition (5.92) for convexity of a function f : Ja, b] ~ R means 
that the points of any arc of the graph of the function lie below the chord subtended 
by the arc (see Fig. 5.11). 

In fact, the left-hand side of (5.92) contains the value f(x) of the function at 
the point x = ax, + a@2x2 € [x1, x2] and the right-hand side contains the value at 
the same point of the linear function whose (straight-line) graph passes through the 
points (x1, f (x1) and (x2, f(x2)). 

Relation (5.92) means that the set E = {(x, y) € R? |x € Ja, b[, f(x) < y} of the 
points of the plane lying above the graph of the function is convex; hence the term 
“convex’’, as applied to the function itself. 


Definition 2 If the opposite inequality holds for a function f : Ja, bl > R, that 
function is said to be concave on the interval Ja, b[, or, more often, convex upward 
in the interval, as opposed to a convex function, which is then said to be convex 
downward on Ja, b{. 
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Since all our subsequent constructions are carried out in the same way for a 
function that is convex downward or convex upward, we shall limit ourselves to 
functions that are convex downward. 

We first give a new form to the inequality (5.92), better adapted for our purposes. 

In the relations x = a,x; + a@2Xx2, a1 + a2 = 1, we have 

xX2—x X— Xj 


a= ? 
X12 — X] 


a), = ? 
X2—X1 


so that (5.92) can be rewritten as 


x 


fQ)< 


x2 
x2 


—x x—X 
fei f (x2). 

XxX] X2—X] 

Taking account of the inequalities x1 <x <x and x; < x2, we multiply by x2 — x, 

and obtain 


(x2 — x) f (x1) + 1 — x2) fe) + @& — x1) f (x2) 2 O. 


Remarking that x2 — x; = (x2 — x) + (x — x1) we obtain from the last inequality, 
after elementary transformations, 


Fx) = FOr) — fG2) — £O) 


xXx—X| x2—X 


(5.93) 


for x1 <x < x2 and any x1, x2 € Ja, D[. 

Inequality (5.93) is another way of writing the definition of convexity of the 
function f (x) on an open interval Ja, b[. Geometrically, (5.93) means (see Fig. 5.11) 
that the slope of the chord I joining (x1, f(x1)) to (x, f(x)) is not larger than (and 
in the case of strict convexity is less than) the slope of the chord II joining (x, f(x)) 
to (x2, f(x2)). 

Now let us assume that the function f : Ja, b[ > R is differentiable on Ja, D[. 
Then, letting x in (5.93) tend first to x;, then to x2, we obtain 


f (x2) — fr) 


X2—X1 


fans = f Go); 


which establishes that the derivative of f is monotonic. 
Taking this fact into account, for a strictly convex function we find, using La- 
grange’s theorem, that 


Ie) IA), Se Fe) 


xXx— xX x2 —X 


food<f'G@)= = f' (2) < f’(2) 


for x1 < &| <x < & < x2, that is, strict convexity implies that the derivative is 
strictly monotonic. 

Thus, if a differentiable function f is convex on an open interval Ja, b[, then f’ 
is nondecreasing on Ja, b[; and in the case when f is strictly convex, its derivative 
f’ is increasing on Ja, D[. 
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These conditions turn out to be not only necessary, but also sufficient for convex- 
ity of a differentiable function. 
In fact, for a <x) <x < x2 <b, by Lagrange’s theorem 


f@) — fF) rey, f (x2) — Ff) 


xX—X ~ x2—X ~ 


f'(&), 


where x; < & <x < & < x9; and if f’(&) < f’(&), then condition (5.93) for 
convexity holds (with strict convexity if f’(&,) < f’(&2)). 
We have thus proved the following proposition. 


Proposition 5 A necessary and sufficient condition for a function f :]a,b[ > R 
that is differentiable on the open interval Ja, b{ to be convex (downward) on that 
interval is that its derivative f’ be nondecreasing on Ja, b[. A strictly increasing f' 
corresponds to a Strictly convex function. 


Comparing Proposition 5 with Proposition 3, we obtain the following corollary. 


Corollary A necessary and sufficient condition for a function f : ]a, b| > R having 
a second derivative on the open interval |a, b{ to be convex (downward) on Ja, b{ is 
that f(x) => 0 on that interval. The condition f"(x) > 0 on Ja, b[ is sufficient to 
guarantee that f is strictly convex. 


We are now in a position to explain, for example, why the graphs of the simplest 
elementary functions are drawn with one form of convexity or another. 


Example 9 Let us study the convexity of f (x) = x® on the set x > 0. Since f’ (x) = 
a(a — 1)x*~?, we have f(x) > 0 for a <0 or a > 1, that is, for these values of 
the exponent @ the power function x® is strictly convex (downward). For 0 <a < | 
we have f”(x) <0, so that for these exponents it is strictly convex upward. For 
example, we always draw the parabola f(x) = x? as convex downward. The other 
cases a = 0 and @ = | are trivial: x = 1 and x! = x. In both of these cases the 
graph of the function is a ray (see Fig. 5.18 on p. 251). 


Example 10 Let f(x) =a*, 0 <a,a#1. Since f(x) =a* In*a > 0, the expo- 
nential function a” is strictly convex (downward) on R for any allowable value of 
the base a (see Fig. 5.12). 

Example 11 For the function f(x) = log, x we have f(x) = -s, so that the 
function is strictly convex (downward) if 0 < a < 1, and strictly convex upward if 
1 <a (see Fig. 5.13). 


Example 12 Let us study the convexity of f(x) = sin x (see Fig. 5.14). 
Since f”(x) = —sinx, we have f”(x) <0 on the intervals m - 2k < x < 
mw(2k + 1) and f”(x) > 0 on 2(2k — 1) < x < m - 2k, where k € Z. It follows 
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Fig. 5.12 
Fig. 5.13 7] 
a>0oO 
0 ‘ x 
O<a<l 
y = log, x 
Fig. 5.14 


from this, for example, that the arc of the graph of sinx on the closed interval 
0 <x < 4 lies above the chord it subtends everywhere except at the endpoints; 


therefore sinx > 2x forO <x < z- 


We now point out another characteristic of a convex function, geometrically 
equivalent to the statement that a convex region of the plane lies entirely on one 
side of a tangent line to its boundary. 


Proposition 6 A function f :]a, b[ > R that is differentiable on the open interval 
Ja, b[ is convex (downward) on Ja, b[ if and only if its graph contains no points 
below any tangent drawn to it. In that case, a necessary and sufficient condition for 
strict convexity is that all points of the graph except the point of tangency lie strictly 
above the tangent line. 


Proof Necessity. Let xo € Ja, b[. The equation of the tangent line to the graph at 
(xo, f (xo)) has the form 


y= f (xo) + f' (x0) — x0), 


so that 


f(®) — y@) = f(x) — f 0) — f’ ro) @ — x0) = (FE) — f' 0) — x0), 
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where & is a point between x and xo. Since f is convex, the function f’(x) is 
nondecreasing on Ja, b[ and so the sign of the difference f(€) — f’(xg) is the same 
as the sign of the difference x — xg. Therefore f(x) — y(x) > 0 at each point x € 
Ja, b[. If f is strictly convex, then f’ is strictly increasing on Ja, b[ and so f(x) — 
y(x) > 0 for x € Ja, b[ and x # xo. 

Sufficiency. If the inequality 


f(x) — y@) = fx) — fo) — fo) — x0) = 0 (5.94) 
holds for any points x, xo € Ja, b[, then 
L@) — Fo) < f'(xo) forx < x09, 
X— XQ 
LODE IONS Hes Foray Sa. 
x — x0 


Thus, for any triple of points x1, x, x2 € Ja, b[ such that x, < x < x2 we obtain 


FO) = FO) — £2) = £O) 


X—X] 7 x2—X 


and strict inequality in (5.94) implies strict inequality in this last relation, which, as 
we see, is the same as the definition (5.93) for convexity of a function. 


Let us now consider some examples. 


Example 13 The function f (x) = e* is strictly convex. The straight line y = x + 1 
is tangent to the graph of this function at (0, 1), since f(0) = e? = 1 and f'O= 
e*|,=9 = 1. By Proposition 6 we conclude that for any x € R 


e*>1+4+x, 
and this inequality is strict for x 4 0. 


Example 14 Similarly, using the strict upward convexity of In x, one can verify that 
the inequality 


Inx<x-1 


holds for x > 0, the inequality being strict for x # 1. 


In constructing the graphs of functions, it is useful to distinguish the points of 
inflection of a graph. 


Definition 3 Let f : U(xo) — R be a function defined and differentiable on a 
neighborhood U (xo) of xo € R. If the function is convex downward (resp. upward) 
on the set U ~ (xo) = {x € U(xo) | x < xo} and convex upward (resp. downward) on 
U* (xo) = {x € U(xo) | x > xo}, then (x0, f (xo)) is called a point of inflection of 
the graph. 
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Thus when we pass through a point of inflection, the direction of convexity of 
the graph changes. This means, in particular, that at the point (xo, f(xo)) the graph 
of the function passes from one side of the tangent line to the other. 

An analytic criterion for the abscissa x of a point of inflection is easy to surmise, 
if we compare Proposition 5 with Proposition 3. To be specific, one can say that if f 
is twice differentiable at xg, then since f’(x) has either a maximum or a minimum 
at x9, we must have f”’(xo) = 0. 

Now if the second derivative f”(x) is defined on U(xq) and has one sign every- 
where on U~ (xo) and the opposite sign everywhere on Ut (xg), this is sufficient for 
f'(x) to be monotonic in U- (xg) and monotonic in U + (xo) but with the opposite 
monotonicity. By Proposition 5, a change in the direction of convexity occurs at 
(xo, f (xo)), and so that point is a point of inflection. 


Example 15 When considering the function f(x) = sinx in Example 12 we found 
the regions of convexity and concavity for its graph. We shall now show that the 
points of the graph with abscissas x = zk, k € Z, are points of inflection. 

Indeed, f" (x) = —sinx, so that f”(x) = 0 at x =ak, k € Z. Moreover, f” (x) 
changes sign as we pass through these points, which is a sufficient condition for a 
point of inflection (see Fig. 5.14 on p. 244). 


Example 16 It should not be thought that the passing of a curve from one side of 

its tangent line to the other at a point is a sufficient condition for the point to be a 

point of inflection. It may, after all, happen that the curve does not have any constant 

convexity on either a left- or a right-hand neighborhood of the point. An example is 

easy to construct, by improving Example 5, which was given for just this purpose. 
Let 

2x3+x3sin- forx £0, 


ro= {5 for x = 0. 


Then x3 < f(x) < 3x? for 0 < x and 3x? < f(x) < x? for x <0, so that the 
graph of this function is tangent to the x-axis at x = 0 and passes from the lower 
half-plane to the upper at that point. At the same time, the derivative of f(x) 

fae 6x? + 3x? sin 3 — 2cos + for x £0, 
0 for x =0 


is not monotonic in any one-sided neighborhood of x = 0. 


In conclusion, we return again to the definition (5.92) of a convex function and 
prove the following proposition. 


Proposition 7 (Jensen’s inequality)!? If f : Ja,b[ > R is a convex function, 
X1,.++,Xp, are points of ja, b[, and a1,...,, are nonnegative numbers such that 


197.L. Jensen (1859-1925) — Danish mathematician. 
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a, +---+a, = 1, then 
F(@1x] +++ + OnXn) SO f (41) +°++ + On f On). (3,95) 
Proof For n = 2, condition (5.95) is the same as the definition (5.92) of a convex 
function. 
We shall now show that if (5.95) is valid for n = m — 1, it is also valid for n = m. 
For the sake of definiteness, assume that a, ~ 0 in the set a1,...,a@,. Then B = 


a2 +---+a, > 0 and 3 fiero a = |. Using the convexity of the function, we 
find 


a2 On 
f(x +++ + OnXn) = (ain +6(2x ess t “*.xn)) < 


<a fon)+ Bf (Gxt +H ‘»), 


since a + B = | and (F 7X] +: + Bxn) € Ja, D[. 
By the induction af ohiesiR we any have 


(2s tet ay) © 2 pen) to ase se 
B is ie ie am 


Consequently 


fax+- + cnn) Sa fer) + Bf (Ban + +H wn) s 
<0 f (x1) + anf (42) 1 On Fn). 


By induction we now conclude that (5.95) holds for any n € N. (For n = 1, relation 
(5.95) is trivial.) 


We remark that, as the proof shows, a strict Jensen’s inequality corresponds to 
strict convexity, that is, if the numbers @1,...,@, are nonzero, then equality holds 
in (5.95) if and only if xj =---=Xxy. 

For a function that is convex upward, of course, the opposite relation to inequality 
(5.95) is obtained: 


S(ayxy +++) + Oy Xn) = ay f (x1) +++ + On f (Xn). (5.96) 


Example 17 The function f(x) = Inx is strictly convex upward on the set of posi- 
tive numbers, and so by (5.96) 
ay inxy +---+ay,Inx, < In(ayxy +--+ + anXn) 


or, 
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Xp x8" < ax tes + Onn (5.97) 
forza; > 0,0; 20,t=1,...;7,and S770; = 1. 
In particular, if a} =---=a, = i, we obtain the classical inequality 
x + mae + x 
A (5.98) 
n 


between the geometric and arithmetic means of n nonnegative numbers. Equality 
holds in (5.98), as noted above, only when x; = x2 =--- = xX,. If we set n = 2, 
ay= >, a2 = a x1 =a, X27 = b in (5.97), we again obtain the known equality 
(5.86). 


Example 18 Let f(x) =x?,x => 0, p > 1. Since such a function is convex, we have 
n Pp n 
(Sea) < Soon 
i=l i=l 


Setting g = = - aj = be iB) ! and x; =ajb oe }) be 4187 here, we ob- 
tain Hoélder’s inequality (5.88): 


n n Y/p/n 1/q 
S0-(B) "Es" 


i=l i=] i=l 


where + + 7 = Land p> 1. 
For p < 1 the function f(x) = x? is convex upward, and so analogous reasoning 
can be carried out in Hélder’s other inequality (5.89). 


5.4.4 L’Hopital’s Rule 


We now pause to discuss a special, but very useful device for finding the limit of a 
ratio of functions, known as |’ H6pital’s rule. 


Proposition 8 (l’H6pital’s rule) Suppose the functions f : ]a,b| > R and g: 
Ja, b| > R are differentiable on the open interval a, b[ (—oo < a < b < +00) with 
g’(x) £0 0n Ja, b[ and 

f'(x) 

g(x) 


>A asxr>a+0 (-wK<A<+00). 


20G.F. de 1’ H6pital’s (1661-1704) — French mathematician, a capable student of Johann Bernoulli, 
a marquis for whom the latter wrote the first textbook of analysis in the years 1691-1692. The 
portion of this textbook devoted to differential calculus was published in slightly altered form by 
1’ H6pital’s under his own name. Thus “l’ H6pital’s rule” is really due to Johann Bernoulli. 
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Then 
f (x) 
g(x) 


in each of the following two cases: 


19 (f(x) > 0) A (g(x) > 0) asx > a +0, 


>A asx>a+0 


or 
oe g(x) ~ wasx—>a+0. 

A similar assertion holds as x > b —0. 

L’ H6pital’s rule can be stated succinctly, but not quite accurately, as follows. The 


limit of a ratio of functions equals the limit of the ratio of their derivatives if the 
latter exists. 


Proof If g(x) #0, we conclude on the basis of Rolle’s theorem that g(x) is strictly 
monotonic on Ja, b[. Hence, shrinking the interval Ja, b[ if necessary by shifting 
toward the endpoint a, we can assume that g(x) 4 0 on Ja, b[. By Cauchy’s theorem, 
for x, y € Ja, b[ there exists a point & € Ja, b[ such that 

f@-fO)_ f'®) 

g(x)—g(y) gE) 


Let us rewrite this equality in a form convenient for us at this point: 


f@) _ fO) it f'@) (: i) 
g(x) g(x) a’) ey” 


As x — a+ 0, we shall make y tend to a + 0 in such a way that 


fQ) ay) 
—- and > 
g(x) g(x) 


This is obviously possible under each of the two hypotheses 1° and 2° that we are 
considering. Since & lies between x and y, we also have € — a+ 0. Hence the 
right-hand side of the last inequality (and therefore the left-hand side also) tends 
to A. 


Example 19 lim,-0 a 


This example should not be looked on as a new, independent proof of the relation 
sine — las x — 0. The fact is that in deriving the relation sin’ x = cos x we already 
made use of the limit just calculated. 

We always verify the legitimacy of applying l’H6pital’s rule after we find the 
limit of the ratio of the derivatives. In doing so, one must not forget to verify con- 
dition 1° or 2°. The importance of these conditions can be seen in the following 


example. 


= lim,+9 @8* = 1. 
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Example 20 Let f(x) =cosx, g(x) =sinx. Then f’(x) = —sinx, g/(x) =cosx, 


ae — +00 as x — +0, while fs —> 0as x —> +0. 


and 


Example 21 


; di 
fe te 


= =0 fora>0. 
x—>+00 x” x>+00 gx! x>+00 ax® 


Example 22 


: x” ; axe! . a(a—1)---(@a—n+1)x*™ 
lm —= lim - =---= lim =0 
x—>+00 q*  x—>+00 a* Ina I> 100 a*(Ina)" 


xan 


for a > 1, since for n > a and a > 1 it is obvious that —> Oif x > +00. 


a 


We remark that this entire chain of equalities was hypothetical until we arrived 
at an expression whose limit we could find. 


5.4.5 Constructing the Graph of a Function 


A graphical representation is often used to gain a visualizable description of a func- 
tion. As a rule, such a representation is useful in discussing qualitative questions 
about the behavior of the function being studied. 

For precise computations graphs are used more rarely. In this connection what 
is important is not so much a scrupulous reproduction of the function in the form 
of a graph as the construction of a sketch of the graph of the function that correctly 
reflects the main elements of its behavior. In this subsection we shall study some 
general devices that are encountered in constructing a sketch of the graph of a func- 
tion. 


a. Graphs of the Elementary Functions 

We recall first of all what the graphs of the main elementary functions look like. 
A complete mastery of these is needed for what follows (Figs. 5.12—5.18). 

b. Examples of Sketches of Graphs of Functions (Without Application 

of the Differential Calculus) 

Let us now consider some examples in which a sketch of the graph of a function can 


be easily constructed if we know the graphs and properties of the simplest elemen- 
tary functions. 
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Fig. 5.15 


Fig. 5.16 


Fig. 5.17 


Fig. 5.18 


Example 23 Let us construct a sketch of the graph of the function 


h =log,2_ 34-72. 


Taking account of the relation 


1 


251 


Tv 


Y = aLrccos © 


y = log,2_37422= 


logy(x2 —3x +2)  logy(x — D(x — 2)’ 
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Fig. 5.19 “ 


we construct successively the graph of the quadratic trinomial y, = x* — 3x 4+ 2, 
then y2 = log, yi (x), and then y = aw (Fig. 5.19). 

The shape of this graph could have been “guessed” in a different way: by first 
determining the domain of definition of the function log,2_3,472 = (log) (x? — 
3x +2))7!, then finding the behavior of the function under approach to the boundary 
points of the domain of definition and on intervals whose endpoints are the boundary 
points of the domain of definition, and finally drawing a “smooth curve” taking 
account of the behavior thus determined at the ends of the interval. 


Example 24 The construction of a sketch of the graph of the function 
y= sin(x*) 


can be seen in Fig. 5.20. 

We have constructed this graph using certain characteristic points for this func- 
tion, the points where sin(x~) =-l, sin(x~) = 0, or sin(x~) = |. Between two ad- 
jacent points of this type the function is monotonic. The form of the graph near the 
point x = 0, y = 0 is determined by the fact that sin(x”) ~ x? as x > 0. Moreover, 
it is useful to note that this function is even. 

Since we will be speaking only of sketches rather than a precise construction 
of the graph of a function, let us agree for the sake of brevity to understand that 
“constructing the graph of a function” is equivalent to “constructing a sketch of the 
graph of the function”. 


Example 25 Let us construct the graph of the function 


y =x +arctan(x? — 1) 
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Fig. 5.20 


(Fig. 5.21). As x — —oo the graph is well approximated by the line y =x — 3, 
while for x + +00 it is approximated by y = x + 3. 


We now introduce a useful concept. 


Definition 4 The line co + cx is called an asymptote of the graph of the function 
y= f(x) as x > —oo (or x > +00) if f(x) — (Co +.¢1x) = o(1) as x > —oo (or 
x — +00). 


Thus in the present example the graph has the asymptote y = x — 5 as x > —oo 
and y=x+ 4 asx — +00. 

If | f(x)| > cw asx > a—O0 (oras x > a +0) it is clear that the graph of the 
function will move ever closer to the vertical line x = a as x approaches a. We call 
this line a vertical asymptote of the graph, in contrast to the asymptotes introduced 
in Definition 4, which are always oblique. 

Thus, the graph in Example 23 (see Fig. 5.19) has two vertical asymptotes and 
one horizontal asymptote (the same asymptote as x — —oo and as x — +00). 

It obviously follows from Definition 4 that 


. f(x) 
cp= lm — 
ioe come ©, 2 Xx 


co = im (f (x) — c1x). 


In general, if f(x) — (co tc1x +--+ + cnx") = o0(1) as x > —on, then 
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Fig. 5.21 yn 


fo) vias 


x 


3 = arctan y2(x) 


1 
y 


Ty | 
[ses 


wl ala 


y = ¥3(x) + ¥3(z) 


_ F(x) 
Ch= li : 
X00 xl 
f(x) - Cnx" 
Cnh-1= lim ————, 
X—>—00 xn-l 


co= lim (f (x) — (c1x +++ + ¢nx")). 


x—2>—-OO 


These relations, written out here for the case x —> —oo, are of course also valid 
in the case x — +00 and can be used to describe the asymptotic behavior of the 
graph of a function f(x) using the graph of the corresponding algebraic polynomial 


Cp tex tees + cnx". 
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Fig. 5.22 p 


Example 26 Let (p,@) be polar coordinates in the plane and suppose a point is 
moving in the plane in such a way that 


p=p(t)=1—e ‘cos at 


yg=g(t)=1—-e sin 51 


at time f (t > 0). Draw the trajectory of the point. 

In order to do this, we first draw the graphs of p(t) and g(t) (Figs. 5.22a and 
5.22b). 

Then, looking simultaneously at both of the graphs just constructed, we can de- 
scribe the general form of the trajectory of the point (Fig. 5.22c). 


c. The Use of Differential Calculus in Constructing the Graph of a Function 


As we have seen, the graphs of many functions can be drawn in their general features 
without going beyond the most elementary considerations. However, if we want to 
make the sketch more precise, we can use the machinery of differential calculus in 
cases where the derivative of the function being studied is not too complicated. We 
shall illustrate this using examples. 


Example 27 Construct the graph of the function y = f(x) when 
f@)=|x +e. 


The function f(x) is defined for x € R\0. Since e~!/* — 1 as x > 0, it follows 
that 


=1/¢ —(x+2) asx > -o, 


|x + 2]e 
(x + 2) as x > +00. 
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a. b. 
Fig. 5.23 
Table 5.2 Interval ]J-—oo, —2[ ]—2, OL ]0, +c0[ 
Sign of f’(x) - + + 
Behavior of f(x) +oo \, 0 0 7 +00 0A +00 


Next, it is obvious that |x + 2|e~!/* + +00 as x > —0, and |x + 2Je"!/* + 0 
as x — +0. Finally, it is clear that f(x) > 0 and f(—2) = 0. On the basis of these 
observations, one can already make a first draft of the graph (Fig. 5.23a). 

Let us now see for certain whether this function is monotonic on the intervals 
J—oo, —2[, [—2,0[, and ]O, +oo[, whether it really does have these asymptotics, 
and whether the convexity of the graph is correctly shown. 

Since 


Dia 5 - 3 
—ete ls, if x < —2, 


f= 


2 ; 
tees, if —2<xandx 40, 


and f’(x) 40, we can form Table 5.2. 

On the regions of constant sign of the derivative, as we know, the function ex- 
hibits the corresponding monotonicity. In the bottom row of the table the symbol 
+oo \, 0 denotes a monotonic decrease in the values of the function from +oo to 
0, and 0 7 +00 denotes monotonic increase from 0 to ++oo. 

We observe that f’(x) > —4e7!/? as x > —2—Oand f’(x) > 4e7!/7 asx > 
—2-+ 0, so that the point (—2, 0) must be a cusp in the graph (a bend of the same 
type as in the graph of the function |x|), and not a regular point, as depicted in 
Fig. 5.23a). Next, f’(x) — 0 as x > +0, so that the graph should emanate from the 
origin tangentially to the x-axis (remember the geometric meaning of f’(x)!). 

We now make the asymptotics of the function as x — —oo and x > +00 more 
precise. 

Since e~!/* = 1—x7-!+0(x7!) as x > 00, it follows that 


—x—l+o(1) asx>-w, 


|x +2|e1/* ~ 
x+1+o0(1) as x > +00, 
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Table 5.3 

Interval ]J-o, —2[ ]—2, O[ 10, 2/3, ]2/3, +oo[ 
Sign of f(x) - + ae _ 
Convexity of f(x) Upward Downward Downward Upward 
so that in fact the oblique asymptotes of the graph are y = —x — | as x — —oo and 


y=x+lasx>+4+o. 

From these data we can already construct a quite reliable sketch of the graph, but 
we shall go further and find the regions of convexity of the graph by computing the 
second derivative: 


2—3x ,—1/) 
= re |x 


f(x) = x 


23s ew! /x if —2<xandx 40. 


if x < —2, 


Since f” (x) = 0 only at x = 2/3, we have Table 5.3. 

Since the function is differentiable at x = 2/3 and f” (x) changes sign as x passes 
through that point, the point (2/3, f(2/3)) is a point of inflection of the graph. 

Incidentally, if the derivative f’(x) had had a zero, it would have been possible 
to judge using the table of values of f’(x) whether the corresponding point was an 
extremum. In this case, however, f’(x) has no zeros, even though the function has 


a local minimum at x = —2. It is continuous at that point and f’(x) changes from 
negative to positive as x passes through that point. Still, the fact that the function 
has a minimum at x = —2 can be seen just from the description of the variation of 


values of f(x) on the corresponding intervals, taking into account, of course, the 
relation f(—2) =0. 
We can now draw a more precise sketch of the graph of this function (Fig. 5.23b). 
We conclude with one more example. 


Example 28 Let (x, y) be Cartesian coordinates in the plane and suppose a moving 
point has coordinates 


ot _t-28 
~ Y= 72’ a ee: 


at time f (t > 0). Describe the trajectory of the point. 

We begin by sketching the graphs of each of the two coordinate functions x = 
x(t) and y = y(t) (Figs. 5.24a and 5.24b). 

The second of these graphs is somewhat more interesting than the first, and so 
we shall describe how to construct it. 

We can see the behavior of the function y = y(t) ast > +0,f > 1-—0,t— 
1+ 0, and the asymptote y(t) = 2t + o(1) as t — +00 immediately from the form 
of the analytic expression for y(f). 

After computing the derivative 


1 —5¢2 +214 


yQ)= ae” 
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Fig. 5.24 

Table 5.4 

Interval 10, til Iti, I ll, ol Jt, +o0[ 
Sign of y(t) + a = + 

Behavior of y(t) 0A y(t) y(th) \—-0 +00 \ y(t2) y(t2) 7 +00 


we find its zeros: t; © 0.5 and f2 © 1.5 in the region t > 0. 

Then, by compiling Table 5.4 we find the regions of monotonicity and the local 
extreme values y(t,) © 5 (a maximum) and y(f2) © 4 (a minimum). 

Now, by studying both graphs x = x(t) and y = y(t) simultaneously, we make a 
sketch of the trajectory of the point in the plane (Fig. 5.24c). 

This sketch can be made more precise. For example, one can determine the 
asymptotics of the trajectory. 

Since lim;-,1 wo = —1 and lim;_.1 (y(t) + x()) = 2, the line y = —x + 2 is an 
asymptote for both ends of the trajectory, corresponding to t approaching 1. It is 


5.4 Differential Calculus Used to Study Functions 259 


also clear that the line x = 0 is a vertical asymptote for the portion of the trajectory 
corresponding to t > +00. 


We find next 
, 1-50? 4214 
yx = x 1+? 
2 : 
As one can easily see, the function ioe decreases monotonically from 1 to —1 


as u increases from 0 to | and increases from —1 to +00 as u increases from 1 to 
+00. 

From the monotonic nature of y/., one can draw conclusions about the convexity 
of the trajectory on the corresponding regions. Taking account of what has just been 
said, one can construct the following, more precise sketch of the trajectory of the 
point (Fig. 5.24d). 

If we had considered the trajectory for t < 0 as well, the fact that x(t) and y(t) 
are odd functions would have added to the curves already drawn in the x y-plane the 
curves obtained from them by reflection in the origin. 


We now summarize some of these results as very general recommendations for 
the order in which to proceed when constructing the graph of a function given ana- 
lytically. Here they are: 


1° Give the domain of definition of the function. 

2° Note the specific properties of the function if they are obvious (for example, 
evenness or oddness, periodicity, identity to some well-known functions up to sim- 
ple coordinate changes). 

3° Determine the asymptotic behavior of the function under approach to boundary 
points of the domain of definition and, in particular, find asymptotes if they exist. 
4° Find the intervals of monotonicity of the function and exhibit its local extreme 
values. 

5° Determine the convexity properties of the graph and indicate the points of in- 
flection. 

6° Note any characteristic points of the graph, in particular points of intersection 
with the coordinate axes, provided there are such and they are amenable to compu- 
tation. 


5.4.6 Problems and Exercises 


1. Letx = (41,...,%,) anda = (q),...,Q@,), where x; > 0,a; > Ofori=1,...,n 
and )~_, a; = 1. For any number t 4 0 we consider the mean of order t of the 
numbers X\,..., Xp, with weights o;: 


i 1/t 
M, (x, a) = pa 


i=l 
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In particular, when a} =--- =a), = i we obtain the harmonic, arithmetic, and 
quadratic means for t = —1, 1, 2 respectively. 
Show that 


a) limy+0 M; (x, @) = xy --- xn", that is, in the limit one can obtain the geomet- 
ric mean; 

b) lim; +00 Mi (x, @) = maxi <j<n Xi3 

c) lim; —oo Mp (x, &) = minj <j<n Xi3 

d) M;(x,q@) is a nondecreasing function of t on R and is strictly increasing if 
n > | and the numbers x; are all nonzero. 


2. Show that |1 + x|? > 1+ px +cp@p(x), where cp is a constant depending only 
on p, 

pps TONS: testy 

x)= if1< ; 

me |? - for beli1, ie 

and g(x) = |x|? on R if 2 < p. 
3. Verify that cosx < (sinx)3 for 0 < |x| < 5. 
4. Study the function f(x) and construct its graph if 


a) f(x) =arctanlog, cos(zx + as 
b) f@= arccos(5 —sinx); 
c) fx) = Vx + 3); 
d) Construct the curve defined in polar coordinates by the equation g = wa 
p = 0, and exhibit its asymptotics; 

e) Show how, knowing the graph of the function y = f(x), one can obtain the 
graph of the following functions f(x) + B, Af(x), f(x +b), f(ax), and, in par- 
ticular — f(x) and f(—~). 


5. Show that if f ¢ C(a, b[—) and the inequality 


‘(2 ==) 2 f(r) + f 2) 
2 2 


holds for any points x), x2 € Ja, b[, then the function f is convex on Ja, b[. 
6. Show that 


a) if a convex function f : R — R is bounded, it is constant; 
b) if 


Xx—>-0OO X x—>+00 Xx 


0, 


for a convex function f : RR — R, then f is constant. 

c) for any convex function f defined on an open interval a < x < +00 (or 
—0oo < x <a), the ratio f &) tends to a finite limit or to infinity as x tends to infinity 
in the domain of definition of the function. 


7. Show that if f : Ja, b[ > R is a convex function, then 
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a) at any point x € Ja, D[ it has a left-hand derivative f’ and a right-hand deriva- 
tive f , defined as 


f@ +h) — f@) 


La= me 


h 
ben pe FO+ND— FOO) 
a a 


and f! (x) < fi (x); 

b) the inequality f{ (x1) < f! (x2) holds for x1, x2 € Ja, b[ and x1 < x2; 

c) the set of cusps of the graph of f(x) (for which f’ (x) 4 fi (x)) 1s at most 
countable. 


8. The Legendre transform”! of a function f : J > R defined on an interval J C R 
is the function 


f*(t) = sup(tx — f(x). 
xel 
Show that 


a) The set /* of values of t € R for which f*(t) € R (that is, f*(t) 4 oo) is 
either empty or consists of a single point, or is an interval of the line, and in this last 
case the function f*(t) is convex on I*. 

b) If f is a convex function, then /* 4 @, and for f* € C(/*) 


(f*)* = sup(xt — f*(1)) = f(x) 


tel* 


for any x € J. Thus the Legendre transform of a convex function is involutive, (its 
square is the identity transform). 
c) The following inequality holds: 


xt<f(x)+ f*(@t) forx el andtel”. 


d) When f is a convex differentiable function, f*(t) = tx; — f(x;), where x; 
is determined from the equation t = f’(x). Use this relation to obtain a geometric 
interpretation of the Legendre transform f* and its argument f, showing that the 
Legendre transform is a function defined on the set of tangents to the graph of f/f. 

e) The Legendre transform of the function f(x) = axe fora > 1 and x > 0 is 
the function f*(t) = at where t > 0 and x + 3 = |. Taking account of c), use 
this fact to obtain Young’s inequality, which we already know: 


1 1 
ant = — 2" +7", 
a B 


214M. Legendre (1752-1833) — famous French mathematician. 
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f) The Legendre transform of the function f(x) = e* is the function f*(t) = 
tIn£, ¢ > 0, and the inequality 


t 
xt<e*4+tIn- 
e 


holds for x € R andt > 0. 


9. Curvature and the radius and center of curvature of a curve at a point. Suppose a 
point is moving in the plane according to a law given by a pair of twice-differentiable 
coordinate functions of time: x = x(t), y = y(t). In doing so, it describes a certain 
curve, which is said to be given in the parametric form x = x(t), y = y(t). A special 
case of such a definition is that of the graph of a function y = f(x), where one may 
take x =t, y = f(t). We wish to find a number that characterizes the curvature of 
the curve at a point, as the reciprocal of the radius of a circle serves as an indication 
of the amount of bending of the circle. We shall make use of this comparison. 


a) Find the tangential and normal components a; and a, respectively of the ac- 
celeration a = (X(t), ¥(f)) of the point, that is, write a as the sum a; + a,, where a; 
is collinear with the velocity vector v(t) = (x(t), y(t)), so that a, points along the 
tangent to the trajectory and a, is directed along the normal to the trajectory. 

b) Show that the relation 


— |v) 
lan (1) | 
holds for motion along a circle of radius r. 


c) For motion along any curve, taking account of b), it is natural to call the 
quantity 


_ |v@)| 
|an(t)| 
the radius of curvature of the curve at the point (x(t), y(f)). 
Show that the radius of curvature can be computed from the formula 


r(t) 


“21 +2)3/2 
0 eee 
Ixy —xy| 
d) The reciprocal of the radius of curvature is called the absolute curvature of a 
plane curve at the point (x(t), y(t)). Along with the absolute curvature we consider 
the quantity 


ij — Hy 


(OF GER 


called the curvature. 
Show that the sign of the curvature characterizes the direction of turning of the 
curve relative to its tangent. Determine the physical dimension of the curvature. 
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e) Show that the curvature of the graph of a function y = f(x) at a point 
(x, f(x)) can be computed from the formula 


" 
k(x) = a ER 2 a 
1+ (PP? 

Compare the signs of k(x) and y” (x) with the direction of convexity of the graph. 

f) Choose the constants a, b, and R so that the circle (x — a)? + (y - b)* = R? 
has the highest possible order of contact with the given parametrically defined curve 
x =x(t), y= y(t). It is assumed that x(t) and y(f) are twice differentiable and that 
((t0), 5(t0)) # (0, 0). 

This circle is called the osculating circle of the curve at the point (xo, yo). Its 
center is called the center of curvature of the curve at the point (xo, yo). Verify that 
its radius equals the radius of curvature of the curve at that point, as defined in b). 

g) Under the influence of gravity a particle begins to slide without any prelim- 
inary impetus from the tip of an iceberg of parabolic cross-section. The equation 
of the cross-section is x + y? = 1, where x > 0, y > 0. Compute the trajectory of 
motion of the particle until it reaches the ground. 


5.5 Complex Numbers and the Connections Among 
the Elementary Functions 


5.5.1 Complex Numbers 


Just as the equation x* = 2 has no solutions in the domain Q of rational numbers, 
the equation x* = —1 has no solutions in the domain R of real numbers. And, just 
as we adjoin the symbol V2 as a solution of x* = 2 and connect it with rational 
numbers to get new numbers of the form 7; + J/2r2, where r 1, r2 € Q, we introduce 
the symbol i as a solution of x? = —1 and attach this number, which lies outside the 
real numbers, to real numbers and arithmetic operations in R. 

One remarkable feature of this enlargement of the field R of real numbers, among 
many others, is that in the resulting field C of complex numbers, every algebraic 
equation with real or complex coefficients now has a solution. 

Let us now carry out this program. 


a. Algebraic Extension of the Field R 


Thus, following Euler, we introduce a number i, the imaginary unit, such that 
i2 = —1. The interaction between i and the real numbers is to consist of the fol- 
lowing. One may multiply i by numbers y € R, that is, numbers of the form iy 
necessarily arise, and one may add such numbers to real numbers, that is, numbers 


of the form x + iy occur, where x, y ER. 
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If we wish to have the usual operations of a commutative addition and a commu- 
tative multiplication that is distributive with respect to addition defined on the set 
of objects of the form x + iy (which, following Gauss, we shall call the complex 
numbers), then we must make the following definitions: 


(x1 +iyy) + (x2 + iy2) = (1 + x2) +101 + y2) (5.99) 


and 


(x1 + iy1) + (x2 +iy2) = (x1x2 — yi y2) Fi y2 + X2y1). (5.100) 


Two complex numbers x; + iy; and x2 + iy2 are considered equal if and only if 
Xx, =x and yj = yp. 

We identify the real numbers x € R with the numbers of the form x +7 - 0, and 
i with the number 0 + i - 1. The role of 0 in the complex numbers, as can be seen 
from Eq. (5.99), is played by the number 0 +i -0 =0€ R; the role of 1, as can be 
seen from Eq. (5.100), is played by 1+i-0=1€ER. 

It follows from properties of the real numbers and definitions (5.99) and (5.100) 
that the set of complex numbers is a field containing R as a subfield. 

We shall denote the field of complex numbers by C and typical elements of it 
usually by z and w. 

The only nonobvious point in the verification that C is a field is the assertion 
that every non-zero complex number z = x + iy has an inverse z~! with respect to 
multiplication (a reciprocal), that is z- z~! = 1. Let us verify this. 

We call the number x — iy the conjugate of z= x + iy, and we denote it Z. 

We observe that z-Z = (x* + y?)+i-0=x*+ y? £0 if z 40. Thus z7! should 
be taken as ‘Z= 


1 x i y 
ty? e4y2 ety?" 


b. Geometric Interpretation of the Field C 


We remark that once the algebraic operations (5.99) and (5.100) on complex num- 
bers have been introduced, the symbol 7, which led us to these definitions, is no 
longer needed. We can identify the complex number z = x + iy with the ordered 
pair (x, y) of real numbers, called respectively the real part and the imaginary part 
of the complex number z. (The notation for this is x = Rez, y = Imz.) 

But then, regarding the pair (x, y) as the Cartesian coordinates of a point of the 
plane R? = R x R, one can identify complex numbers with the points of this plane 
or with two-dimensional vectors having coordinates (x, y). 

In such a vector interpretation the coordinatewise addition (5.99) of complex 
numbers corresponds to vector addition. Moreover such an interpretation naturally 
leads to the idea of the absolute value or modulus |z| of a complex number as the 
absolute value or length of the vector (x, y) corresponding to it, that is 


lzj=VJx2+y2, ifz=xtiy, (5.101) 
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and also to a way of measuring the distance between complex numbers z; and zz as 
the distance between the points of the plane corresponding to them, that is, as 


Iz1 — z2|= Jo — x2)? + (1 — y2)?. (5.102) 


The set of complex numbers, interpreted as the set of points of the plane, is called 
the complex plane and also denoted by C, just as the set of real numbers and the real 
line are both denoted by R. 

Since a point of the plane can also be defined in polar coordinates (r, g) con- 
nected with Cartesian coordinates by the relations 


x =rcosg, 
: (5.103) 
y=rsing, 
the complex number 
zZ=x+iy (5.104) 
can be represented in the form 
z=r(cosg +ising). (5.105) 


The expressions (5.104) and (5.105) are called respectively the algebraic and 
trigonometric (polar) forms of the complex number. 

In the expression (5.105) the number r > 0 is called the modulus or absolute 
value of the complex number z (since, as one can see from (5.103), r = |z|), and @ 
the argument of z. The argument has meaning only for z 4 0. Since the functions 
cos g and sing are periodic, the argument of a complex number is determined only 
up to a multiple of 277, and the symbol Arg z denotes the set of angles of the form 
gy + 20k, k € Z, where ¢ is any angle satisfying (5.105). When it is desirable for 
every complex number to determine uniquely some angle g € Arg z, one must agree 
in advance on the range from which the argument is to be chosen. This range is 
usually either 0 < g < 27 or —m < g <a. If such a choice has been made, we say 
that a branch (or the principal branch) of the argument has been chosen. The values 
of the argument within the chosen range are usually denoted arg z. 

The trigonometric form (5.105) for writing complex numbers is convenient in 
carrying out the operation of multiplication of complex numbers. In fact, if 


Z1 =ri(cosg¢) +7 sing), 
Z2 =r2(cosg2 +i sing), 
then 
Z1° 22 = (ri cosg) + ir) sing) )(r2 COs G2 + ir? SiN g2) = 
= (r1r2 COS GY] COS Y2 — r1r2 SING, SiNg2)+ 


+ i(rir2 sin g] Cos Y2 + 1r1r2 COS G2 SiN 2), 
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or 


Z1 +22 =rir2(cos(y2 + go) +i sin(gi + ¢2)). (5.106) 


Thus, when two complex numbers are multiplied, their moduli are multiplied and 
their arguments are added. 

We remark that what we have actually shown is that if g; € Argz, and @2 € 
Arg z2, then 9; + ¢2 € Arg(zj - z2). But since the argument is defined only up to a 
multiple of 277, we can write that 


Arg(Z1 - 22) = Argz1 + Argz2, (5.107) 


interpreting this equality as set equality, the set on the right-hand side being the set 
of all numbers of the form g; + ¢2, where yg; € Arg z, and g2 € Arg z2. Thus it is 
useful to interpret the sum of the arguments in the sense of the set equality (5.107). 

With this understanding of equality of arguments, one can assert, for example, 
that two complex numbers are equal if and only if their moduli and arguments are 
equal. 

The following formula of de Moivre?” follows by induction from formula 
(5.106): 


ifz=r(cosg+ising), then z” =r”"(cosng+isinng). (5.108) 


Taking account of the explanations given in connection with the argument of a 
complex number, one can use de Moivre’s formula to write out explicitly all the 
complex solutions of the equation z” =a. 

Indeed, if 


a=p(cosy+isinw) 
and, by formula (5.108) 
z" =r" (cosng +isinng), 


we have r = 2/p and ng = wy + 27k, k € Z, from which we have gy = 7 + an k, 
Different complex numbers are obviously obtained only for k = 0,1,...,n — 1. 
Thus we find n distinct roots of a: 


za = YB(cos( % + x) +isin(* +e) (k=0,1,...,n—1). 
n n 


n n 
In particular, if a = 1, that is, o = 1 and w = 0, we have 
2. 2 
“=e i= cos( x) 4i sin( x) (k=0,1,....2—1). 
n n 


These points are located on the unit circle at the vertices of a regular n-gon. 


22.4. de Moivre (1667-1754) — British mathematician. 


5.5 Complex Numbers and Elementary Functions 267 


In connection with the geometric interpretation of the complex numbers them- 
selves, it is useful to recall the geometric interpretation of the arithmetic operations 
on them. 

For a fixed b € C, the sum z +b can be interpreted as the mapping of C into itself 
given by the formula z+» z +b. This mapping is a translation of the plane by the 
vector b. 

For a fixed a = |a|(cosg + ising) 4 0, the product az can be interpreted as the 
mapping z — az of C into itself, which is the composition of a dilation by a factor 
of |a| and a rotation through the angle g € Arga. This is clear from formula (5.106). 


5.5.2 Convergence in C and Series with Complex Terms 


The distance (5.102) between complex numbers enables us to define the e- 
neighborhood of a number zo € C as the set {z € C | |z — zo| < e}. This set is 
a disk (without the boundary circle) of radius ¢ centered at the point (xo, yo) if 
20 = xj +iyo. 

We shall say that a sequence {z,} of complex numbers converges to Zo € C if 
limp oo |Zn — Z0| = 0. 

It is clear from the inequalities 


max {|xp — xol, Yn — yol} < |Zn — Zol < |xn — Xo| + lyn — vol (5.109) 


that a sequence of complex numbers converges if and only if the sequences of real 
and imaginary parts of the terms of the sequence both converge. 

By analogy with sequences of real numbers, a sequence of complex numbers {Z,} 
is called a fundamental or Cauchy sequence if for every ¢ > 0 there exists an index 
N EN such that |z, — z| <«¢ foralln,m>WN. 

It is clear from inequalities (5.109) that a sequence of complex numbers is a 
Cauchy sequence if and only if the sequences of real and imaginary parts of its 
terms are both Cauchy sequences. 

Taking the Cauchy convergence criterion for sequences of real numbers into ac- 
count, we conclude on the basis of (5.109) that the following proposition holds. 


Proposition 1 (The Cauchy criterion) A sequence of complex numbers converges if 
and only if it is a Cauchy sequence. 


If we interpret the sum of a series of complex numbers 


ztzote tanto: (5.110) 


as the limit of its partial sums s, = zj +--- + Z, aS nm — OO, we also obtain the 
Cauchy criterion for convergence of the series (5.110). 
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Proposition 2 The series (5.110) converges if and only if for every € > O there exists 
N EN such that 


IZm +--+ +2Zn| <€ (5.111) 

for any natural numbersn>m> N. 
From this one can see that a necessary condition for convergence of the series 
(5.110) is that z, — 0 as n —> oo. (This, however, is also clear from the very defini- 


tion of convergence.) 
As in the real case, the series (5.110) is absolutely convergent if the series 


Izu] +]z2])+---+ [zal +--: (5.112) 


converges. 
It follows from the Cauchy criterion and the inequality 


l2m t-++ + Zn] <|Zm| 4+ -°- + [Zn 


that if the series (5.110) converges absolutely, then it converges. 


Examples The series 

I l+ qct qe te thetten, 
2) z— 423+ G25—---, and 

3) l- yet acta 


converge absolutely for all z € C, since the series 

VW) b+ qlel+ xl? +e, 

2!) lal + gle + gle +--, 

3) 1+ plz? + glzl*t+---, 

all converge for any value of |z| € R. We remark that we have used the equality 
|z”| = |z|” here. 


Example 4 The series 1 + z+ z* +--+ converges absolutely for |z| < 1 and its sum 
iss = — For |z| = 1 it does not converge, since in that case the general term does 
not tend to zero. 


Series of the form 
cot e1(z — 20) +++ + en (Z — 20)" + °° (5.113) 


are called power series. 
By applying the Cauchy criterion (Sect. 3.1.4) to the series 


lcol + |e1(z — zo)| +--+ + len(z —z0)"|+°-°, (5.114) 
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we conclude that this series converges if 
—— -1 
— zo| < (lim Vc 
|Z — Zol (lim, | a) > 


and that the general term does not tend to zero if |z — zo| > (impo */Jen|)7!. 
From this we obtain the following proposition. 


Proposition 3 (The Cauchy—Hadamard”* formula) The power series (5.113) con- 
verges inside the disk |z— zo| < R with center at zo and radius given by the Cauchy— 
Hadamard formula 


1 
R = 
limn—+oo */|en| 


At any point exterior to this disk the power series diverges. 
At any point interior to the disk, the power series converges absolutely. 


(5.115) 


Remark In regard to convergence on the boundary circle |z — zo| = R Proposition 3 
is silent, since all the logically admissible possibilities really can occur. 


Examples The series 


Bee 
ie: 


and 


co 1 
7) oni we 


converge in the unit disk |z| < 1, but the series 5) diverges at every point z where 
|z| = 1. The series 6) diverges for z = 1 and (as one can show) converges for z = —1. 
The series 7) converges absolutely for |z| = 1, since \-52"| = +. 


One must keep in mind the possible degenerate case when R = 0 in (5.115), 
which was not taken account of in Proposition 3. In this case, of course, the entire 
disk of convergence degenerates to the single point zg of convergence of the series 
G13). 

The following result is an obvious corollary of Proposition 3. 


Corollary (Abel’s first theorem on power series) [f the power series (5.113) con- 
verges at some value z*, then it converges, and indeed even absolutely, for any value 
of z satisfying the inequality |z — zo| < |z* — Zol. 


The propositions obtained up to this point can be regarded as simple extensions 
of facts already known to us. We shall now prove two general propositions about 


23J, Hadamard (1865-1963) — well-known French mathematician. 
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series that we have not proved up to now in any form, although we have partly 
discussed some of the questions they address. 


Proposition 4 [fa series 7} + z2 +--+ +2Zn +--+ of complex numbers converges 
absolutely, then a series Zn, + Zn. +++: +2%n, +:++: obtained by rearranging” its 
terms also converges absolutely and has the same sum. 


Proof Using the convergence of the series }*>~ , |zn|, given a number ¢ > 0, we 
choose N €N such that }°7° v4.1 |Znl <é. 

We then find an index K € N such that all the terms in the sum Sy = z} +---+zN 
are among the terms of the sum 8% = Zn, +--:+2n, fork > K.Ifs= YG Zn, We 
find that fork > K 


[o,@) [o,@) 
ls— 5] <|s—swltisv—Hl< D> lenlt+ So lenl <2e. 
n=N+1 n=N+1 


Thus we have shown that 5; — s as k — oo. If we apply what has just been 
proved to the series |z1| + |Z2| +--+ [Zn] +--+ and |Z] + [Zng| +++ + [Znqjl + 

, we find that the latter series converges. Thus Proposition 4 is now completely 
proved. 

Our next proposition will involve the product of two series 


(aj Hag t---ta,t::-)-(b) tho t--- +b, +--). 


The problem is that if we remove the parentheses and form all possible pair-wise 
products a;b;, there is no natural order for summing these products, since we have 
two indices of summation. The set of pairs (i, 7), where i, 7 € N, is countable, as 
we know. Therefore we could write down a series having the products a;b; as terms 
in some order. The sum of such a series might depend on the order in which these 
terms are taken. But, as we have just seen, in absolutely convergent series the sum 
is independent of any rearrangement of the terms. Thus, it is desirable to determine 
when the series with terms a;b; converges absolutely. 


Proposition 5 The product of absolutely convergent series is an absolutely conver- 
gent series whose sum equals the product of the sums of the factor series. 


Proof We begin by remarking that whatever finite sum }° a;b; of terms of the form 
ajb; we take, we can always find N such that the product of the sums Ay = a; + 
-+ ay and By = bi +---+ by contains all the terms in that sum. Therefore 


iad) < Ee ee ae Smis Dba Syl 


i,j=l i=l i=l i! 


24The term with index k in this series is the term Zn, With index n, in the original series. Here the 
mapping N35 kt nx € N is assumed to be a bijective mapping on the set N. 
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from which it follows that the series ee jz , 4b; converges absolutely and that its 
sum is uniquely determined independently of the order of the factors. In that case the 
sum can be obtained, for example, as the limit of the products of the sums A, = a, + 

-- +a, and B, = by +--+ +by. But A,B, > AB asn— oo, where A= 7° | ay 
and B = }°°° | by, which completes the proof of Proposition 5. 


The following example is very important. 


Example 8 The series )°° 5 J a” and ro =a 1b” converge absolutely. In the 
product of these series let us group together all monomials of the form a”b” having 
the same total degree n + m =k. We then obtain the series 


But 


k 
1 npm 1 k} npk—n 1 k 
2 a0 ae Serer = ere 


'yy1 
nim. 
m+n=k 


and therefore we find that 


1 1 1 
n m k 
ya" Yo = at oy (5.116) 
n=0 m=0 k=0 


5.5.3 Euler’s Formula and the Connections Among 
the Elementary Functions 


In Examples 1)-3) we established the absolute convergence in C of the series ob- 
tained by extending into the complex domain the Taylor series of the functions e’, 
sinx, and cosx, which are defined on R. For that reason, the following definitions 
are natural ones to make for the functions e*, cos z, and sinz in C: 


1 1 1 
e* =expz: Peete tae +: (5.117) 
= 1 ly ul 
COS Z1= ee aT see, (5.118) 
. 1 1 
sinzi=Z— a Hag = (5.119) 


3! 5! 
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Following Euler,”> let us make the substitution z = iy in Eq. (5.117). By suitably 
grouping the terms of the partial sums of the resulting series, we find that 


1+ 71 Ey) + yO) + 3 Gy) +5) + 5 (ir) ra 


ee a FH. Ae, Te 
=(1-5p Tar --) +i(y ag 5 


that is, 


e'” =cosy +isiny. (5.120) 


This is the famous Euler formula. 

In deriving it we used the fact that i? =—-1, i? = -i, i* =1, i> =i, and so 
forth. The number y in formula (5.120) may be either a real number or an arbitrary 
complex number. 

It follows from the definitions (5.118) and (5.119) that 


cos(—z) = cosz, 
sin(—z) = —sinz, 
that is, cos z is an even function and sin z is an odd function. Thus 
e” =cosy —isiny. 
Comparing this last equality with formula (5.120), we obtain 
cosy = 1 (ei +e"), 
2 


siny = xe —e'’), 
i 


Since y is any complex number, it would be better to rewrite these equalities 
using notation that leaves no doubt of this fact: 


(5.121) 


Thus, if we assume that expz is defined by relation (5.117), then formulas 
(5.121), which are equivalent to the expansions (5.118) and (5.119), like the for- 


25, Euler (1707-1783) — eminent mathematician and specialist in theoretical mechanics, of Swiss 
extraction, who lived the majority of his life in St. Petersburg. In the words of Laplace, “Euler is 
the common teacher of all mathematicians of the second half of the eighteenth century.” 
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mulas 


—_ 


cosh y = 5 (e +e“), 
(5.122) 


sinhz = Gl —e%), 


can be taken as the definitions of the corresponding circular and hyperbolic func- 
tions. Disregarding all the considerations about trigonometric functions that led us 
to this step, which have not been rigorously justified (even though they did lead 
us to Euler’s formula), we can now perform a typical mathematical trick and take 
formulas (5.121) and (5.122) as definitions and obtain from them in a completely 
formal manner all the properties of the circular and trigonometric functions. 

For example, the fundamental identities 


cos” z+ sin? z = 1, 
cosh” Z- sinh” z=l, 
like the parity properties, can be verified immediately. 


The deeper properties, such as, for example, the formula for the cosine and sine 
of a sum follow from the characteristic property of the exponential function: 


exp(z1 + z2) = exp(z1) - exp(z2), (5.123) 


which obviously follows from the definition (5.117) and formula (5.116). Let us 
derive the formulas for the cosine and sine of a sum: 
On the one hand, by Euler’s formula 


ef 1 +22) — cos(z + 22) +i sin(z1 + 22). (5.124) 
On the other hand, by the property of the exponential function and Euler’s formula 


ef @1+22) — eZ e!22 = (cosz; +i sinz)(cosz2 +i sinz2) = 


= (cos Z] COS Z2 — Sinz, Sinzz) + i(sinz, COs Z2 + cos Z2 Sinz). (5.125) 


If z; and z2 were real numbers, then, equating the real and imaginary parts of the 
numbers in formulas (5.124) and (5.125), we would now have obtained the required 
formulas. Since we are trying to prove them for any z1, Z2 € C, we use the fact that 
cos z is even and sinz is odd to obtain yet another equality: 


e!@1+22) = (cos z1 cos z2 — sin z1 sinz2) — i(sinz} cos z2 + cos zj sinz2). (5.126) 


Comparing (5.125) and (5.126), we find 


cos(z1 + z2) = (ei Gite) di e Hatz) = cos z1 Cos 22 — sinz) Sinz, 


1 
2 
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dy Heaied a 
sin(z1 + z2) = — (ei @1 +22) —e pe) = sin z1 Cos z2 + cos z] sinz2. 


2i 

The corresponding formulas for the hyperbolic functions cosh z and sinhz could 

be obtained in a completely analogous manner. Incidentally, as can be seen from 

formulas (5.121) and (5.122), these functions are connected with cos z and sinz by 
the relations 


coshz = cosiz, 
sinh z = —i siniz. 


However, to obtain even such geometrically obvious facts as the equality 
sinz = 0 or cos(z +27) = cos z from the definitions (5.121) and (5.122) is very dif- 
ficult. Hence, while striving for precision, one must not forget the problems where 
these functions naturally arise. For that reason, we shall not attempt at this point 
to overcome the potential difficulties connected with the definitions (5.121) and 
(5.122) when describing the properties of the trigonometric functions. We shall re- 
turn to these functions after presenting the theory of integration. Our purpose at 
present was only to demonstrate the remarkable unity of seemingly completely dif- 
ferent functions, which would have been impossible to detect without going into the 
domain of complex numbers. 

If we take as known that for x € R 


cos(x + 27) =cosx, sin(x + 277) = sinx, 


cos0 = 1, sin0 = 0, 


then from Euler’s formula (5.120) we obtain the relation 


7 4+1=0, (5.127) 


in which all the most important constants of the different areas of mathematics are 
represented: | (arithmetic), 7 (geometry), e (analysis), and i (algebra). 
From (5.123) and (5.127), as well as from (5.120), one can see that 


exp(z + i277) =expz, 


that is, the exponential function is a periodic function on C with the purely imagi- 
nary period T =i2z7. 

Taking account of Euler’s formula, we can now represent the trigonometric no- 
tation (5.105) for a complex number in the form 


i 
z=re’®, 


where r is the modulus of z and ¢ its argument. 
The formula of de Moivre now becomes very simple: 


Za rreln? (5.128) 
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Fig. 5.25 
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5.5.4 Power Series Representation of a Function. Analyticity 


A function w = f(z) of a complex variable z with complex values w, defined on 
a set E CC, is a mapping f : E —> C. The graph of such a function is a subset 
of C x C=R? x R? = R’, and therefore is not visualizable in the traditional way. 
To compensate for this loss to some extent, one usually keeps two copies of the 
complex plane C, indicating points of the domain of definition in one and points of 
the range of values in the other. 

In the examples below the domain E and its image under the corresponding 
mapping are indicated. 


Example 9 See Fig. 5.25. 
Example 10 See Fig. 5.26. 
Example 11 See Fig. 5.27. 


im/2 


These correspondences follow from the equalities i = e'7/*, z= re!?, and iz = 


rei’+/2) that is, a rotation through angle 5 has occurred. 


Example 12 See Fig. 5.28. 
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Fig, 5.28 wi @ a 
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Fig. 5.29 yy 
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Fig. 5.30 i © > ms 
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Fig. 5.31 iy iv 
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For, if z= re’, then z? = r2e!?9. 


U 


2 


Example 13 See Fig. 5.29. 
Example 14 See Fig. 5.30. 


It is clear from Examples 12 and 13 that under this function the unit disk maps 
into itself, but is covered twice. 


Example 15 See Fig. 5.31. 
If z=re!”, then by (5.128), we have 7” = r"e!”® so that in this case the image 


of the disk of radius r is the disk of radius r”, each point of which is the image of n 
points in the original disk (located, as it happens, at the vertices of a regular n-gon). 
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The only exception is the point w = 0, whose pre-image is the point z = 0. How- 
ever, as z — 0, the function z” is an infinitesimal of order n, and so we say that at 
z = 0 the function has a zero of order n. Taking account of this kind of multiplicity, 
one can now say that the number of pre-images of every point w under the map- 
ping z+> z” = w is n. In particular, the equation z” = 0 has the n coincident roots 
ZS: = Zn, = 0. 

In accordance with the general definition of continuity, a function f(z) of 
a complex variable is called continuous at a point zo € C if for any neighbor- 
hood V (f(zo)) of its value f(zo) there exists a neighborhood U(zo) such that 
f(z) € Vf (zo)) for all z € U(zo). In short, 


lim f(z) = f (Zo). 
ZZ 
The derivative of a function f(z) at a point zo, as for the real-valued case, is 


defined as 


fein (5.129) 


220 Z—Z0 
if this limit exists. 


The equality (5.129) is equivalent to 


f(@) — f (20) = f' (zo)(z — Z0) + 0(Z — Z0) (5.130) 


as Z — Zg, corresponding to the definition of differentiability of a function at the 
point Zo. 

Since the definition of differentiability in the complex-valued case is the same 
as the corresponding definition for real-valued functions and the arithmetic proper- 
ties of the fields C and R are the same, one may say that all the general rules for 
differentiation hold also in the complex-valued case. 


Example 16 


(f +8) (2) = f'(@) +8’), 
(fg) (2) = f' @slz) + fe’ (2), 
(go fy’ @=s8'(f@)-f'®, 


so that if f(z) =z’, then f’(z) =1-z+z-1=2z, orif f(z) =z", then f’(z) = 
nz"—!, and if 


Py(z) =co + c1(Z — 20) +-++ + en(Z — 20)", 
then 


P’(z) =c; + 2e2(z— zp) +--+ nen (z — zt, 
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Theorem 1 The sum f(z) = (>. en(z — Zo)" of a power series is an infinitely 
differentiable function inside the entire disk in which convergence occurs. Moreover, 


) — dt 
n 
x O= da (ole <p)"),, FSO; 1j..0; 


and 


Ch = 


+ £(@), 2=0,1,..-. 

n! 

Proof The expressions for the coefficient follows in an obvious way from the ex- 
pressions for f@ (z) fork =n and z = zo. 

As for the formula for f(z), it suffices to verify this formula for k = 1, since 
the function f’(z) will then be the sum of a power series. 

Thus, let us verify that the function g(z) = °°, nen(z — zo)"—! is indeed the 
derivative of f(z). 

We begin by remarking that by the Cauchy—Hadamard formula (5.115) the radius 
of convergence of the derived series is the same as the radius of convergence R of 
the original power series for f(z). 

For simplicity of notation from now on we shall assume that zo = 0, that 
is, f (2) = eg enz”, O(z) = D2 nenz"—| and that these series converge for 
IzZ| << R. 

Since a power series converges absolutely on the interior of its disk of conver- 
gence, we note (and this is crucial) that the estimate |nc,z’~!| = nlen||z|"~! < 
n|cp|r”—! holds for |z| <r < R, and that series yl |r"! converges. Hence, 
for any ¢ > 0 there exists an index N such that 


lo) [oe] € 
>. NCnz || < om ncyr" | < 3 
n=N+1 n=N+1 


for |z| <r. 

Thus at any point of the disk |z| <r the function g(z) is within 3 of the Nth 
partial sum of the series that defines it. 

Now let ¢ and z be arbitrary points of this disk. The transformation 


FO-f@O_ QB oraz" 
a eae 


and the estimate |c,(¢"~! + --- + z"7!)| < |c,|nr”~! enable us to conclude, as 
above, that the difference quotient we are interested in is equal within | to the 
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partial sum of the series that defines it, provided |¢| <r and |z| <r. Hence, for 
|¢| <r and |z| <r we have 


fO=I@. 
tC —Z 


N 
a re 


n=1 


ON E 
™~ — 

=o nenz" : +$+2=. 

Zz 3 


n=1 


p(z)| < 


If we now fix z and let ¢ tend to z, passing to the limit in the finite sum, we see 
that the right-hand side of this last inequality will be less than ¢ for ¢ sufficiently 
close to z, and hence the left-hand side will be also. 

Thus, for any point z in the disk |z| < r < R, we have verified that f’(z) = g(z). 
Since r is arbitrary, this relation holds for any point of the disk |z| < R. 


This theorem enables us to specify the class of functions whose Taylor series 
converge to them. 

A function is analytic at a point zo € C if it can be represented in a neighborhood 
of the point in the following (“analytic”) form: 


F@=> ene — 20)", 


n=0 


that is, as the sum of a power series in z — Zo. 

It is not difficult to verify (see Problem 7 below) that the sum of a power series 
is analytic at any interior point of the disk of convergence of the series. 

Taking account of the definition of analyticity, we deduce the following corollary 
from the definition of analyticity. 


Corollary a) /f a function is analytic at a point, then it is infinitely differentiable at 
that point, and its Taylor series converges to it in a neighborhood of the point. 

b) The Taylor series of a function defined in a neighborhood of a point and in- 
finitely differentiable at that point converges to the function in some neighborhood 
of the point if and only if the function is analytic. 


In the theory of functions of a complex variable one can prove a remarkable fact 
that has no analogue in the theory of functions of a real variable. It turns out that if a 
function f(z) is differentiable in a neighborhood of a point zo € C, then it is analytic 
at that point. This is certainly an amazing fact, since it then follows from the theorem 
just proved that if a function f(z) has one derivative f’(z) in a neighborhood of a 
point, it also has derivatives of all orders in that neighborhood. 

At first sight this result is just as surprising as the fact that by adjoining to R a 
root i of the one particular equation z7 = —1 we obtain a field C in which every 
algebraic polynomial P(z) has a root. We intend to make use of the fact that an 
algebraic equation P(z) = 0 has a solution in C, and for that reason we shall prove it 
as a good illustration of the elementary concepts of complex numbers and functions 
of a complex variable introduced in this section. 
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Fig. 5.32 iy 


5.5.5 Algebraic Closedness of the Field C of Complex Numbers 


If we prove that every polynomial P(z) = co+cyzZ+-+:+¢nz”",n = 1, with complex 
coefficients has a root in C, then there will be no need to enlarge the field C because 
some algebraic equation is not solvable in C. In this sense the assertion that every 
polynomial P(z) has a root establishes that the field C is algebraically closed. 

To obtain a clear idea of the reason why every polynomial has a root in C while 
there can fail to be a root in R, we use the geometric interpretation of complex 
numbers and functions of a complex variable. 

We remark that 


co Cl Cn—-1 
Peyat( 24 —— fe +e), 
Zz va z 


so that P(z) = cyz” + 0(z”) as |z| > oo. Since we are interested in finding a root 
of the equation P(z) = 0, dividing both sides of the equation by c,, we may assume 
that the leading coefficient c, of P(z) equals 1, and hence 


P(z) =z" +0(2") as |z| > 00. (5.131) 


If we recall (Example 15) that the circle of radius r maps to the circle of radius r” 
with center at 0 under the mapping z+ z”, we see that for sufficiently large values 
of r the image of the circle |z| =r under the mapping w = P(z) will be, with small 
relative error, the circle |w| =r” in the w-plane (Fig. 5.32). What is important is 
that, in any case, it will be a curve that encloses the point w = 0. 

If the disk |z| <r is regarded as a film stretched over the circle |z| =, this 
film is mapped into a film stretched over the image of that disk under the mapping 
w = P(z). But, since the latter encloses the point w = 0, some point of that film 
must coincide with w = 0, and hence there is a point Zo in the disk |z| < r that maps 
to w = 0 under the mapping w = P(z), that is, P(zo) = 0. 

This intuitive reasoning leads to a number of important and useful concepts of 
topology (the index of a path with respect to a point, and the degree of a mapping), 
by means of which it can be made into a complete proof that is valid not only 
for polynomials, as one can see. However, these considerations would unfortunately 
distract us from the main subject we are now studying. For that reason, we shall give 
another proof that is more in the mainstream of the ideas we have already mastered. 
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Theorem 2 Every polynomial 

P(z) =cg toyz +-++ + n2" 
of degree n => | with complex coefficients has a root in C. 


Proof Without loss of generality, we may obviously assume that c, = 1. 
Let uw = inf,<c |P(z)|. Since P(z) =z”(1+ wal 5 ie ae we have 


Ch— Cc 
| P(z)| > at"( | n 1 ae | a). 


[Z| |z|" 


and obviously |P(z)| > max{1, 2} for |z| > R if R is sufficiently large. Conse- 
quently, the points of a sequence {z,} at which 0 < |P(zx)| — u < 7 lie inside the 
disk |z| < R. 

We shall verify that there is a point zg in C (in fact, in this disk) at which 
|P(zo)| = u. To do this, we remark that if z, = xz + iy,z, then max{|xx|, |yg|} < 
|zk| < R and hence the sequences of real numbers {x;,} and {yx} are bounded. 
Choosing first a convergent subsequence {x;,} from {x,} and then a convergent 
subsequence {yz,,} from {yx}, we obtain a subsequence zx, = Xk, + 1¥k,, Of 
the sequence {z,} that has a limit limy,— 49 Z%,, = liMm—o0 Xk, + EWM in 00 Vk = 
Xo + iyo = Zo, and since |zx,,| — |zo| as m — oo, it follows that |zo| < R. So as to 
avoid cumbersome notation, and not have to pass to subsequences, we shall assume 
that the sequence {zx} itself converges. It follows from the continuity of P(z) at 
zo € C that limg_+oo P(zx) = P(zo). But then”® | P(zo)| = limg_+o0 | P (ze) | = L- 

We shall now assume that 2 > 0, and use this assumption to derive a contra- 
diction. If P(zo) 4 0, consider the polynomial Q(z) = Fler eo) - By construction 


P(z0) 
_ — |P&+z0)| 
Q(0) = 1 and |Q(Z)| = pq, 21. 


Since Q(0) = 1, the polynomial Q(z) has the form 


Q(z) = 1+ gaz + qepiz’t! +++ +Gn2", 
where |gx| AO and 1 <k <n. If qx = pel”, then for g = —_ we Shall have qx - 
(e!?)k = pel%el™—W) = pe!™ = —p = —|qx|. Then, for z= re'” we obtain 
|O(re'?)| < [E+ geet] + (lgerre! | +--+ Janz") = 
= |b r"lgel| +r (gest +2 + lanl 1) = 


=1—r*(\gel —rlqezil—--+— 1" lanl) <1, 


6Observe that on the one hand we have shown that from every sequence of complex numbers 
whose moduli are bounded one can extract a convergent subsequence, while on the other hand we 
have given another possible proof of the theorem that a continuous function on a closed interval 
has a minimum, as was done here for the disk |z| < R. 
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if r is sufficiently close to 0. But |Q(z)| => 1 for z € C. This contradiction shows 
that P(zo) =0. 


Remark 1 The first proof of the theorem that every algebraic equation with complex 
coefficients has a solution in C (which is traditionally known as the fundamental 
theorem of algebra) was given by Gauss, who in general breathed real life into the 
so-called “imaginary” numbers by finding a variety of profound applications for 
them. 


Remark 2 A polynomial with real coefficients P(z) = ag +- +--+ a,z", as we know, 
does not always have real roots. However, compared with an arbitrary polynomial 
having complex coefficients, it does have the unusual property that if P(zo) = 0, 
then P(zo9) = 0 also. Indeed, it follows from the definition of the complex conjugate 
and the rules for adding complex numbers that (z; + zo) = Z1 + Z2. It follows from 
the trigonometric form of writing a complex number and the rules for multiplying 
complex numbers that 


(71° Z2)= (rie!?! # rei?) = ryr2e! G1+92) = 


—i(gi +92) _ rie?! rye 12 


=ryjr2e =7Z|-Z2. 


Thus, 


P(z0) =a0 +--+ +42) = Go +--+ +GnZ§ =a0 +--+ + an% = Po), 
and if P(zg) =0, then P(z9) = P(Z) = 0. 


Corollary 1 Every polynomial P(z) =co+-+:+¢nz" of degree n > | with complex 
coefficients admits a representation in the form 


P(zZ) =¢n(Z — 21) +++ (Z — Zn), (5.132) 


where Z1,...,Zn € C (and the numbers z1,..., Zn are not necessarily all distinct). 
This representation is unique up to the order of the factors. 


Proof From the long division algorithm for dividing one polynomial P(z) by an- 
other polynomial Q(z) of lower degree, we find that P(z) = g(z) Q(z) +r(z), where 
q(z) and r(z) are polynomials, the degree of r(z) being less than the degree m of 
Q(z). Thus if m = 1, then r(z) =r is simply a constant. 

Let z; be a root of the polynomial P(z). Then P(z) = q(z)(z — z1) +r, and 
since P(z,) =r, it follows that r = 0. Hence if z; is a root of P(z), we have the 
representation P(z) = (z — z1)q(z). The degree of the polynomial g(z) isn — 1, and 
we can repeat the reasoning with g(z) ifn — 1 > 1. By induction we find that P(z) = 
c(z — 21) +++ (Z — Zn). Since we must have cz” = cyz”, it follows that c = cy. 


Corollary 2 Every polynomial P(z) =a9+-+--+4nz" with real coefficients can be 
expanded as a product of linear and quadratic polynomials with real coefficients. 
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Proof This follows from Corollary | and Remark 2, by virtue of which for any 
root zx of P(z) the number Zz, is also a root. Then, carrying out the multiplication 
(z — Zx)(z — Zz) in the product (5.132), we obtain the quadratic polynomial a 
(ze + Z)Z+ \ze|? with real coefficients. The number c,, which equals dy, is a real 
number in this case and can be moved inside one of the sets of parentheses without 
changing the degree of that factor. 


By multiplying out all the identical factors in (5.132), we can rewrite that prod- 
uct: 


P(z) = ene — 21)" +++ (z— zp)”. (5.133) 


The number k; is called the multiplicity of the root z;. 
Since P(z) = (z — z,)*i Q(z), where Q(z;) #0, it follows that 


P'(2) =kj(e— zy 1O@ + &—zp)#O'@ = — zu RE), 
where R(z;) =k; Q(z;) #0. We thus arrive at the following conclusion. 


Corollary 3 Every root z; of multiplicity k; > 1 of a polynomial P(z) is a root of 
multiplicity k; — 1 of the derivative P’(z). 


Not yet being in a position to find the roots of the polynomial P(z), we can use 
this last proposition and the representation (5.133) to find a polynomial p(z) = (z — 
Z1)++:(Z — Zp) whose roots are the same as those of P(z) but are of multiplicity 1. 

Indeed, by the Euclidean algorithm, we first find the greatest common divisor 
q(z) of P(z) and P’(z). By Corollary 3, the expansion (5.133), and Theorem 2, 
the polynomial g(z) is equal, apart from a constant factor, to (z — z})M17!---(z — 
Zz Pe Hence by dividing P(z) by g(z) we obtain, apart from a constant factor 
that can be removed by dividing out the coefficient of z?, a polynomial p(z) = 
(Z—Z1)+++ (2 —Zp). 

Now consider the ratio R(x) = a 
the degree of P(x) is larger or equal than that of Q(x), we apply the division algo- 
rithm and represent P(x) as p(x) Q(x) +r(x), where p(x) and r(x) are polynomi- 
als, the degree of r(x) being ae than that of Q(x). Thus we obtain a representation 
of the form R(x) = f ~ is now a proper fraction 
in the sense that the degree of oy is less than that of Q(x). 

The corollary we are about to state involves the representation of a proper fraction 
as a sum of fractions called partial fractions. 


Corollary 4 a) [If Q(z) = (z- ra ees Zp) and ae is a proper fraction, 
P@ 


there exists a unique representation of the fraction 0 in the form 


P(z) 
—— .134 
Fo ¥(y tA. 6.194) 


j=l \k=1 
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b) If P(x) and Q(x) are polynomials with real coefficients and 


Mn 


O(x) = (« — xy)" = xp) (x? + pix tai)” (x? + pax tan)", 


there exists a unique representation of the proper fraction a in the form 
P(x) : ‘i jk iy Dix +; 
j J j 
= : ab : 5.135 
OW) H(> i) o(> ate) 609 


where a jx, bjx, and c jx are real numbers. 


We remark that there is a universal method of finding the expansions (5.134) and 
(5.135) known as the method of undetermined coefficients, although this method is 
not always the shortest way. It consists of putting all the terms on the right-hand side 
of (5.134) or (5.135) over a common denominator, then equating the coefficients of 
the resulting numerator to the corresponding coefficients of P(x). The system of 
linear equations that results always has a unique solution because of Corollary 4. 

Since we shall as a rule be interested in the expansion of a specific fraction, which 
we shall obtain by the method of undetermined coefficients, we require nothing 
more from Corollary 4 than the assurance that it is always possible to do so. For that 
reason, we shall not bother to go through the proof. It is usually couched in algebraic 
language in a course of modern algebra and in analytic language in a course in the 
theory of functions of a complex variable. 

Let us consider a specially chosen example to illustrate what has just been ex- 
plained. 


Example 17 Let 


P(x) = 2x° + 3x° + 6x4 + 6x? + 10x? + 3x +2, 
OG) Sx! 43084 50" 4 x ee ie Se I, 
Find the partial-fraction expansion (5.135) of the fraction ae : 
First of all, the problem is complicated by the fact that we do not know the 
factors of the polynomial Q(x). Let us try to simplify the situation by eliminating 
any multiple roots there may be of Q(x). We find 


O! (x) = Tx® + 18x° + 25x4 + 28x3 + 21x? + 10x +3. 


By a rather fatiguing, but feasible computation using the Euclidean algorithm, 
we find the greatest common divisor 


d(x) =x* +2x3+4+2x74+2x+1 


of Q(x) and Q’(x). We have written the greatest common divisor with leading co- 
efficient 1. 
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Dividing Q(x) by d(x), we obtain the polynomial 
Ao 2 
q(x)=x°+x°4+x4+1, 


which has the same roots as Q(x), but each with multiplicity 1. The root —1 is 
easily guessed. After g(x) is divided by x + 1, we find a quotient of x? + 1. Thus 


q(x) = (x + 1)(x? +1), 


and then by successively dividing d(x) by x* + 1 and x + 1, we find the factorization 
of d(x); 
d(x) = (x + 1)?(x* + 1), 


and then the factorization 


O(x) = («+ 1)3(x? +1)’. 


P(x) 


OG) in the form 


Thus, by Corollary 4b, we are seeking an expansion of the fraction 


P(x) ay aj2 a13 byx+ey  byox+e12 
= + zt 3 + 3 2 2° 
Q(x) x4+1 @w4l) (x + 1) x*+1 (x* + 1) 


Putting the right-hand side over a common denominator and equating the coeffi- 
cients of the resulting numerator to those of P(x), we arrive at a system of seven 
equations in seven unknowns, solving which, we finally obtain 


P@) 1 2g Ba eal oe 
Ox) x+1 (e+)? @H+D3 x? 4+10 241?’ 


5.5.6 Problems and Exercises 


1. Using the geometric interpretation of complex numbers 


a) explain the inequalities |z} + z2| < |zi| + |z2| and |z1| +--- + [Zn] S |zil+ 
+ [Znl3 
b) exhibit the locus of points in the plane C satisfying the relation |z — 1| + 
Iz +1) <3; 
c) describe all the mth roots of unity and find their sum; 
d) explain the action of the transformation of the plane C defined by the formula 
ZH> Zz. 


2. Find the following sums: 


a) l+q+-:-+4"; 
by Lab gob see4eg” +> for [q] < 1; 
Cc) 1l+e%+---+e"?; 
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d) l+rei? +... + r%ei"?; 

e) L+rel? +---+r%el"? +... for |r| < 1; 

f) l+rcosg+---+r"cosng; 

g) l+rcosg+---+r"cosng+--: for |r| <1; 
h) l+rsing+---+r" sinng; 

i) l+rsing+---+r”sinng+.--- for |r| <1. 


3. Find the modulus and argument of the complex number lim,—,.0(1 + 2)" and 
verify that this number is e<. 
4. a) Show that the equation e” = z in w has the solution w = In|z| +7 Arg z. It is 
natural to regard w as the natural logarithm of z. Thus w = Lnz is not a functional 
relation, since Arg z is multi-valued. 

b) Find Ln1 and Lni. 

c) Set z¥ =e%)"2. Find 17 and i’. 

d) Using the representation w = sinz = on (e!< — e~!2), obtain an expression for 
Z =arcsin w. 

e) Are there points in C where | sin z| = 


5. a) Investigate whether the function f(z) = zt — is continuous at all points of 

the plane C. 

b) Expand the function 
of convergence. 

c) Solve parts a) and b) for the function + = rae x, where A € R is a parameter. 

Can you make a conjecture as to how the radius of convergence is determined by 
the relative location of certain points in the plane C? Could this relation have been 
understood on the basis of the real line alone, that is, by expanding the function 
where A € R and x € R? 


oe > in a power series around zg = 0 and find its radius 


coe Gee 
14)2x2? 
6. a) Investigate whether the Cauchy function 


-1/ 22 
ae e , 240, 
0, z=0 
is continuous at z = 0. 
b) Is the restriction f|p of the function f in a) to the real line continuous? 
c) Does the Taylor series of the function f in a) exist at the point zo = 0? 
d) Are there functions analytic at a point zo € C whose Taylor series converge 
only at the point zo? 
e) Invent a power series pape Cy(Z — Zo)” that converges only at the one point 
Z0- 


7. a) Making the formal substitution z— a = (z — zo) + (zo — @) in the power series 
yg An(z — a)" and gathering like terms, obtain a series )°-° y Cn(z — zo)” and 
expressions for its coefficients in terms of A; and (zo — a)* ,k=0,1,.... 

b) Verify that if the original series converges in the disk |z — a| < R and |zg — 
a| =r < R, then the series defining C,, n = 0, 1,..., converge absolutely and the 
series poe ~9 Cn(z — 20)” converges for |z — zo| < R. —r. 
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Fig. 5.33 


c) Show that if f(z) = S729 An(z — a)” in the disk |z — a| < R and |zo —a| < 
R, then in the disk |z — zo| < R — |zo — a| the function f admits the representation 
f @) = Vin=o Cn(Z = 20)". 
8. Verify that 


a) as the point z € C traverses the circle |z| =r > 1 the point w=z+z7! 
traverses an ellipse with center at zero and foci at +2; 

b) when a complex number is squared (more precisely, under the mapping w b> 
w*), such an ellipse maps to an ellipse with a focus at 0, traversed twice; 

c) under squaring of complex numbers, any ellipse with center at zero maps to 
an ellipse with a focus at 0. 


5.6 Some Examples of the Application of Differential Calculus 
in Problems of Natural Science 


In this section we shall study some problems from natural science that are very dif- 
ferent from one another in their statement, but which, as will be seen, have closely 
related mathematical models. That model is none other than a very simple differ- 
ential equation for the function we are interested in. From the study of one such 
example — the two-body problem — we really began the construction of differential 
calculus. The study of the system of equations we obtained for this problem was 
inaccessible at the time. Here we shall consider some problems that can be solved 
completely at our present level of knowledge. In addition to the pleasure of seeing 
mathematical machinery in action in a specific case, from the series of examples 
in this section we shall in particular acquire additional confidence in both the nat- 
uralness with which the exponential function exp x arises and in the usefulness of 
extending it to the complex domain. 


5.6.1 Motion of a Body of Variable Mass 


Consider a rocket moving in a straight line in outer space, far from gravitating bodies 
(Fig. 5.33). 

Let M(t) be the mass of the rocket (including fuel) at time t, V(f) its velocity 
at time f, and w the speed (relative to the rocket) with which fuel flows out of the 
nozzle of the rocket as it burns. 

We wish to establish the connection among these quantities. 
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Under these assumptions, we can regard the rocket with fuel as a closed system 
whose momentum (quantity of motion) remains constant over time. 

At time ¢ the momentum of the system is M(t) V(t). 

At time tf +) the momentum of the rocket with the remaining fuel is M(t + 
h)V(f +h) and the momentum AJ of the mass of fuel ejected over that time 
|AM|=|M(t+h) — M(t)| = —(M(t +h) — M(t)) lies between the bounds 


(V(t) —@)|AM| < AI <(V(t+h)—@)|AM|, 


that is, AJ = (V(t) — w)|AM|+ a(h)|AM, and it follows from the continuity of 
V(t) that a(h) > Oash > 0. 
Equating the momenta of the system at times ¢ and ¢ + h, we have 


M()V(t)=M(t+h)Vt+h)+ (VG) —o)|AM|+a(h)|AMI, 
or, after substituting |AM| = —(M(t +h) — M(#)) and simplifying, 
M(t+h)(V@+h)—-V@)) = 
=—o(M(t +h) — M@)) +a(h)(M@ +h) - MO). 
Dividing this last equation by / and passing to the limit as h — 0, we obtain 
M(t)V'(t) = —@M'(t). (5.136) 


This is the relation we were seeking between the functions we were interested in, 
V(t), M(t), and their derivatives. 

We now must find the relation between the functions V(t) and M(t) themselves, 
using the relation between their derivatives. In general a problem of this type is more 
difficult than the problem of finding the relations between the derivatives knowing 
the relation between the functions. However, in the present case, this problem has a 
completely elementary solution. 

Indeed, after dividing Eq. (5.136) by M(t), we can rewrite it in the form 


V(t) = (—wInM)'(t). (5.137) 
But if the derivatives of two functions are equal on an interval, then the functions 
themselves differ by at most a constant on that interval. 
Thus it follows from (5.137) that 
V(t) =—olnM(t)+c. (5.138) 


If it is known, for example, that V(0) = Vo, this initial condition determines the 
constant c completely. Indeed, from (5.138) we find 


c=Vo+talnM(0), 
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and we then find the formula we were seeking~’ 


M(0) 
M(t)- 


Vi)=Vo+oln (5.139) 

It is useful to remark that if mp is the mass of the body of the rocket and mp is 
the mass of the fuel and V is the terminal velocity achieved by the rocket when all 
the fuel is expended, substituting M(0) = mp +mepg and M(t) = mp, we find 


ME 
V= ¥y+oin(I + =), 


MR 


This last formula shows very clearly that the terminal velocity is affected not so 
much by the ratio mp/mp inside the logarithm as by the outflow speed w, which 
depends on the type of fuel used. It follows in particular from this formula that if 
Vo = 0, then in order to impart a velocity V to a rocket whose own mass is mp one 
must have the following initial supply of fuel: 


mR = mr(eV!/? = 1). 


5.6.2 The Barometric Formula 


This is the name given to the formula that exhibits the dependence of atmospheric 
pressure on elevation above sea level. 

Let p(h) be the pressure at elevation h. Since p(h) is the weight of the column of 
air above an area of 1 cm? at elevation h, it follows that p(h+A) differs from p(h) 
by the weight of the portion of the gas lying in the parallelepiped whose base is the 
original area of 1 cm? and the same area at elevation h + A. Let p(h) be the density 
of air at elevation h. Since p(h) depends continuously on h, one may assume that 
the mass of this portion of air is calculated from the formula 


p(&) g/cm’ - 1 cm*- Acm= p(é)Ag, 


where é is some height between h and h + A. Hence the weight of that mass7® is 


gpa. 
Thus, 


pih+ A) — p(h)=—gp(§)A. 


27This formula is sometimes connected with the name of K.E. Tsiolkovskii (1857-1935), a Russian 
scientist and the founder of the theory of space flight. But it seems to have been first obtained by 
the Russian specialist in theoretical mechanics I.V. Meshcherskii (1859-1935) in an 1897 paper 
devoted to the dynamics of a point of variable mass. 


28 Within the region where the atmosphere is noticeable, g may be regarded as constant. 
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After dividing this equality by A and passing to the limit as A — 0, taking ac- 
count of the relation € — h, we obtain 


p'(h) = —gp(h). (5.140) 


Thus the rate of variation in atmospheric pressure has turned out to be propor- 
tional to the density of the air at the corresponding elevation. 

To obtain an equation for the function p(h), we eliminate the function p(h) from 
(5.140). By Clapeyron’s law”? (the ideal gas law) the pressure p, molar volume V, 
and temperature T of the gas (on the Kelvin*® scale) are connected by the relation 


pV _ 


R, 5.141 
T ( ) 


where R is the so-called universal gas constant. If M is the mass of one mole of air 
and V its volume, then p = y, so that from (5.141) we find 


Setting 7 = £T, we thus have 
p=X(T)p. (5.142) 


If we now assume that the temperature of the layer of air we are describing is con- 
stant, we finally obtain from (5.140) and (5.142) 


& 
p'(h) = —5 ph). (5.143) 
This differential equation can be rewritten as 


pth) g 


pth) 


(In p)'(h) = (-£:) 


or 


from which we derive 
In p(h) = —Sh ee: 
or 


p(h) =e -e B/N 


2°R.PE. Clapeyron (1799-1864) — French physicist who studied thermodynamics. 
30W. Thomson (Lord Kelvin) (1824-1907) — famous British physicist. 
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The factor e“ can be determined from the known initial condition p(0) = po, 
from which it follows that e° = po. 
Thus, we have found the following dependence of pressure on elevation: 


p= poe 8/V". (5.144) 


For the air at sea level (at zero altitude) and at zero temperature (273 K = 0 °C), 
by the above formula A = ce it would be possible to obtain for 4 the value 
7.8 x 10* (m/s)?. We set g = 10 m/s”. The formula (5.144) acquires a completely 
finished form after these numerical values are substituted. In particular, for such val- 
ues A and g, the formula (5.144) shows that the pressure drops by a factor of e (* 3) 

a 


at the height h = z= 7.8 x 10° m=7.8 km. It will increase by the same amount, 


if one descends to a mine at a depth of order 7.8 km.*! 


5.6.3 Radioactive Decay, Chain Reactions, and Nuclear Reactors 


It is known that the nuclei of heavy elements are subject to sporadic (spontaneous) 
decay. This phenomenon is called natural radioactivity. 

The main statistical law of radioactivity (which is consequently valid for amounts 
and concentrations of a substance that are not too small) is that the number of decay 
events over a small interval of time / starting at time f is proportional to / and to the 
number N (ft) of atoms of the substance that have not decayed up to time f, that is, 


N(t+h)—N(t)® -AN(t)h, 


where A > 0 is a numerical coefficient that is characteristic of the chemical element. 
Thus the function N(t) satisfies the now familiar differential equation 


N'(t)=—AN(t), (5.145) 
from which it follows that 
N(t)= Noe”, 


where No = N (0) is the initial number of atoms of the substance. 

The time 7 required for half of the initial number of atoms to decay is called 
the half-life of the substance. The quantity T can thus be found from the equa- 
tion e47 = 7 that is, T = 2 x 0.89 For example, for polonium-210 (Po?!°) the 
half-life T is approximately 138 days, for radium-226 (Ra?*°), T ~ 1600 years, 
for uranium-235 (U2), T ¥7.1 x 108 years, and for its isotope U233, 7 24.5 x 


10° years. 


31Jt should be noted that this barometric formula corresponds only approximately to the real dis- 
tribution of pressure and density of the air in Earth’s atmosphere. Some corrections and additions 
are given in Problem 2 at the end of this section. 
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A nuclear reaction is an interaction of nuclei or of a nucleus with elementary 
particles resulting in the appearance of a nucleus of a new type. This may be nu- 
clear fusion, in which the coalescence of the nuclei of lighter elements leads to the 
formation of nuclei of a heavier element (for example, two nuclei of heavy hydrogen 
— deuterium — yield a helium nucleus along with a release of energy); or it may be 
the decay of a nucleus and the formation of one or more nuclei of lighter elements. 
In particular, such decay occurs in approximately half of the cases when a neutron 
collides with a U?*> nucleus. The breakup of the uranium nucleus leads to the for- 
mation of 2 or 3 new neutrons, which may then participate in further interactions 
with nuclei, causing them to split and thereby leading to further multiplication of 
the number of neutrons. A nuclear reaction of this type is called a chain reaction. 

We shall describe a theoretical mathematical model of a chain reaction in a ra- 
dioactive element and obtain the law of variation in the number N(t) of neutrons as 
a function of time. 

We take the substance to have the shape of a sphere of radius r. If r is not too 
small, on the one hand new neutrons will be generated over the time interval h 
measured from some time ¢ in a number proportional to h and N(t), while on the 
other hand some of the neutrons will be lost, having moved outside the sphere. 

If v is the velocity of a neutron, then the only ones that can leave the sphere in 
time h are those lying within v/ of its boundary, and of those only the ones whose 
direction of motion is approximately along a radius. Assuming that those neutrons 
constitute a fixed proportion of the ones lying in this zone, and that neutrons are 
distributed approximately uniformly throughout the sphere, one can say that the 
number of neutrons lost over the time interval h is proportional to N(t) and the 
ratio of the volume of this boundary layer to the volume of the sphere. 

What has just been said leads to the equality 


N@+h)—N(t)h¥aN(Hh— Eh (5.146) 


(since the volume of the boundary layer is approximately 4rr*vh, and the volume 
of the sphere is 4nr°). Here the coefficients w and 6 depend only on the particular 
radioactive substance. 

After dividing by h and passing to the limit in (5.146) as h — 0, we obtain 


N'(t)= (« = e) wo. (5.147) 


N(t)= Noern| (« _ ae 


It can be seen from this formula that when (a — a) > 0, the number of neutrons 
will increase exponentially with time. The nature of this increase, independently of 
the initial condition No, is such that practically total decay of the substance occurs 


from which 
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over a very short time interval, releasing a colossal amount of energy — that is an 
explosion. 

If (a — B) < 0, the reaction ceases very quickly since more neutrons are being 
lost than are being generated. 

If the boundary condition between the two conditions just considered holds, that 
is, @ — g = 0, an equilibrium occurs between the generation of neutrons and their 
exit from the reaction, as a result of which the number of neutrons remains approx- 
imately constant. 

The value of r at which a — B = 0 is called the critical radius, and the mass of 
the substance in a sphere of that volume is called the critical mass of the substance. 

For U??> the critical radius is approximately 8.5 cm, and the critical mass ap- 
proximately 50 kg. 

In nuclear reactors, where steam is produced by a chain reaction in a radioactive 
substance there is an artificial source of neutrons, providing the fissionable mat- 
ter with a certain number 7 of neutrons per unit time. Thus for an atomic reactor 
Eq. (5.147) is slightly altered: 


N'(th= («- Pon. (5.148) 


This equation can be solved by the same device as Eq. (5.147), since 


aoe‘ is the derivative of the function =, TF In[((a — ByN (t) + n] if 


a — é # 0. Consequently the solution of Eq. (5.148) has the form 


oe Noe’@— 8/1! = apr ll _ e(@-B/nt} ifa— B #0, 
No +nt if — 4 =0. 


It can be seen from this solution that if a — B > 0 (supercritical mass), an explo- 


sion occurs. If the mass is pre-critical, however, that is, a — B < 0, we shall very 
soon have 


NWQ)e — 
Oe 
= 

Thus, if the mass of radioactive substance is maintained in a pre-critical state but 
close to critical, then independently of the power of the additional neutron source, 
that is, independently of n, one can obtain higher values of N(t) and consequently 
greater power from the reactor. Keeping the process in the pre-critical zone is a 


delicate matter and is achieved by a rather complicated automatic control system. 


5.6.4 Falling Bodies in the Atmosphere 


We are now interested in the velocity v(t) of a body falling to Earth under the 
influence of gravity. 
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If there were no air resistance, the relation 
v(t) =g, (5.149) 


would hold for fall from relatively low altitudes. This law follows from Newton’s 
second law ma = F and the law of universal gravitation, by virtue of which for 
h < R (where R is the radius of the Earth) 


Mm Mm 


PO= CRT hoy OR 


= gm. 


A body moving in the atmosphere experiences a resistance that depends on the 
velocity of the motion, and as a result, the velocity of free fall for a heavy body does 
not grow indefinitely, but stabilizes at a certain level. For instance, during skydiving 
or parachuting, if the parachute of a skydiver does not open for a long time, the speed 
of a midsized skydiver in the lower strata of the atmosphere stabilizes at 50-60 m/s. 

We shall assume that under these conditions (that is, for this body and a speed 
range from 0 to 60), the resistance force is proportional to the velocity of the body 
moving in the air. By equating the acting forces on the body, we arrive at the fol- 
lowing equation, which must satisfy the velocity of the free-falling body in the at- 
mosphere: 

mv(t) =mg — av. (5.150) 


Dividing this equation by m and denoting * by £, we finally obtain 
v(t) =—fvt+g. (5.148’) 


We have now arrived at an equation that differs from Eq. (5.148) only in notation. 
We remark that if we set —Bu(t) + g = f(t), then, since f’(t) = —Bv'(t), one can 
obtain from (5.148) the equivalent equation 


f't)=—-Bf), 


which is the same as Eq. (5.143) or (5.145) except for notation. Thus we have once 
again arrived at an equation whose solution is the exponential function 


fO=fOe™. 


It follows from this that the solution of Eq. (5.148’) has the form 


v(t) = 58+ ( - ser 


and the solution of the basic equation (5.150) has the form 


vie es (v = ms) eorm, (5.151) 
a a 


where vg = v(0) is the initial vertical velocity of the body. 
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It can be seen from (5.151) that for aw > 0 a body falling in the atmosphere reaches 
a steady state at which u(t) ~ %g. Thus, in contrast to fall in airless space, the 
velocity of descent in the atmosphere depends not only on the shape of the body, but 
also on its mass. As a — 0, the right-hand side of (5.151) tends to vo + gt, that is, 
to the solution of Eq. (5.149) obtained from (5.150) when a = 0. 

Using formula (5.151), one can get an idea of how quickly the limiting velocity 
of fall in the atmosphere is reached. 

For example, if a parachute is designed to that a person of average size will fall 
with a velocity of the order 10 meters per second when the parachute is open, then, 
if the parachute opens after a free fall during which a velocity of approximately 
50 meters per second has been attained, the person will have a velocity of about 
12 meters per second three seconds after the parachute opens. 

Indeed, from the data just given and relation (5.151) we find *g ~ 10, 7 ~ 1, 
vp = 50 m/s, so that relation (5.151) assumes the form 


v(t) =10+40e~*. 


Since e* ~ 20, for t = 3, we obtain v © 12 m/s. 

Note that at the opening of the parachute, which is significantly bigger on the 
front surface compared with the body of the skydiver, the force of resistance is 
already proportional to the square of the speed. This means that after the opening of 
the parachute, the slowdown will be quicker than that obtained from the calculation 
with formula (5.151), corresponding to Eq. (5.150). 

It is useful to rewrite and solve the equation of type (5.150) with this new assump- 
tion of the quadratic dependence of the resistance force on the speed of movement 
and to make the necessary correction to the result obtained in the first calculation.** 


5.6.5 The Number e and the Function exp x Revisited 


By examples we have verified (see also Problems 3 and 4 at the end of this section) 
that a number of natural phenomena can be described from the mathematical point 
of view by the same differential equation, namely 


fsa, (5.152) 


32Tt should be noted that here, like in many others studied examples of problems of natural science, 
we illustrate the calculation method. In the calculation are included some original data (empirical 
constants, various assumptions) on which depends the level of concordance of the calculations with 
the reality. This data are not given by mathematics but by a corresponding science. For example, 
in order to obtain data on the aerodynamic properties of the bodies, in particular the nature of the 
change in the resistance force occurring during their movement in the atmosphere, it is necessary 
often to perform a long series of experiments in wind tunnels. Mathematics helps to calculate one 
or another mathematical model of the phenomenon, it often provides a ready calculation procedure, 
but the correctness of the initial data and the adequacy of the model is the work of another relevant 
science (biology, chemistry, physics, ...). 
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whose solution f(x) is uniquely determined when the “initial condition” f (0) is 
specified. Then 


f(x) = fOe™. 


We introduced the number e and the function e* = expx in a rather formal way 
earlier, assuring the reader that e really was an important number and exp x really 
was an important function. It is now clear that even if we had not introduced this 
function earlier, it would certainly have been necessary to introduce it as the solution 
of the important, though very simple equation (5.152). More precisely, it would have 
sufficed to introduce the function that is the solution of Eq. (5.152) for some specific 
value of a, for example, a = 1; for the general equation (5.152) can be reduced to 
this case by changing to a new variable t connected with x by the relation x = £ 
(a £0). 


Indeed, we then have 


d dF(t) 
sa=$(2)= Fo, re) = <aF' 


dt 


and instead of the equation f’(x) = af (x) we then have a F’(t) =a F(t), or F’(t) = 
F(t). 
Thus, let us consider the equation 


i M=f (5.153) 


and denote the solution of this equation satisfying f(0) = 1 by exp x. 

Let us check to see whether this definition agrees with our previous definition of 
exp x. 

Let us try to calculate the value of f(x) starting from the relation f(0) = 1 and 
the assumption that f satisfies (5.153). Since f is differentiable, it is continuous. 
But then Eq. (5.153) implies that f’(x) is also continuous. Moreover, it follows 
from (5.153) that f also has a second derivative f” (x) = f’(x), and in general that 
f is infinitely differentiable. Since the rate of variation f’(x) of the function f (x) 
is continuous, the function f’ changes very little over a small interval h of variation 
of its argument. Therefore f (x9 +h) = f (xo) + f’(&)h & f (xo) + f’(xo)h. Let us 
use this approximate formula and traverse the interval from 0 to x in small steps of 
sizeh= where n € N. If x9 = O and xp41 = xx +h, we should have 


Sf (ea) © fxn) + f'C)h. 
Taking account of (5.153) and the condition f(0) = 1, we have 


f(x) = fn) © fOn-1) + fi! On-1)h = 
= f (%n-1)(1 +h) © (f On—2) +f’ @n—2)h) A +h) = 
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Fig. 5.34 


= f(%n-2)(1 +h)? >--- ® fo) +h)" = 
= f(O)1+h)" = (1+*) 


It seems natural (and this can be proved) that the smaller the step h = *, the closer 
the approximation in the formula f(x) + (1+ ae 
Thus we arrive at the conclusion that 


f(x) = lim (1+ *) 
noo n 


In particular, if we denote the quantity f(1) = lim,..(1 + Lyn by e and show 
that e 4 1, we shall have obtained 


n 
f(x) = lim (14 *) =lim( +1)" = lim[(.+9)'/"]* =e", (5.154) 
noo n t>-0 t>0 


since we know that u® > v® ifu > v. 

This method of solving Eq. (5.153) numerically, which enabled us to obtain for- 
mula (5.154), was proposed by Euler long ago, and is called Euler’s polygonal 
method. This name is connected with the fact that the computations carried out 
in it have a geometric interpretation as the replacement of the solution f(x) of the 
equation (or rather its graph) by an approximating graph consisting of a broken line 
whose links on the corresponding closed intervals [x;, xx41] (k =0,...,2 — 1) are 
given by the equations y = f (xx) + f’ (xg) (x — xx) (See Fig. 5.34). 

We have also encountered the definition of the function exp x as the sum of the 
power series )°>~ 4 ax", This definition can also be reached from Eq. (5.153) by 
using the following frequently-applied device, called the method of undetermined 
coefficients. We seek a solution of Eq. (5.153) as the sum of a power series 


fx) =cotcix tes ++ enx te, (5.155) 


whose coefficients are to be determined. 
As we have seen (Theorem 1 of Sect. 5.5) Eq. (5.155) implies that cy 
But, by (5.153), f(0) = f’(0) =---= f™(O) =---, and since f(0) = 1, we have 


_ f£™@) 
_ n!\ 
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1 


Cn = a> 


that is, if the solution has the form (5.155) and f (0) = 1, then necessarily 


1 i) 1 , 
f@)H=lt ext axe $0. + =x" 4+--- 
1! 2! n! 

We could have verified independently that the function defined by this series is 
indeed differentiable (and not only at x = 0) and that it satisfies Eq. (5.153) and the 
initial condition f(0) = 1. However, we shall not linger over this point, since our 
purpose was only to find out whether the introduction of the exponential function 
as the solution of Eq. (5.153) with the initial condition f(0) = | was in agreement 
with what we had previously meant by the function exp x. 

We remark that Eq. (5.153) could have been studied in the complex plane, that 
is, we could have regarded x as an arbitrary complex number. When this is done, 
the reasoning we have carried out remains valid, although some of the geometric 
intuitiveness of Euler’s method may be lost. 

Thus it is natural to expect that the function 


1 1 1 
Z —+ ae a a | 7h eee 
Pa brae tae © seer 


is the unique solution of the equation 


fO=f@ 


satisfying the condition f(0) = 1. 


5.6.6 Oscillations 


If a body suspended from a spring is displaced from its equilibrium position, for 
example by lifting it and then dropping it, it will oscillate about its equilibrium 
position. Let us describe this process in its general form. 

Suppose it is known that a force is acting on a point mass m that is free to move 
along the x-axis, and that the force F = —kx is proportional*? to the displacement of 
the point from the origin. Suppose also that we know the initial position x9 = x (0) of 
the point mass and its initial velocity vo = x (0). Let us find the dependence x = x(t) 
of the position of the point on time. 

By Newton’s law, this problem can be rewritten in the following purely mathe- 
matical form: Solve the equation 


mx (t) = —kx(t) (5.156) 


under the initial conditions x9 = x(0), x(0) = vo. 


33Jn the case of a spring, the coefficient k > 0 characterizing its stiffness is called the modulus. 
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Let us rewrite Eq. (5.156) as 
k 
X(t) + —x(t) =0 (5.157) 
m 
and again try to make use of the exponential. Specifically, let us try to choose the 


number A so that the function x(t) =e” satisfies Eq. (5.157). 
Making the substitution x(t) = e* in (5.157), we obtain 


k 
(2+ le" =0, 
m 


or 
> &k 
A“ +—=0, (5.158) 
m 
that is, A; = —,/ as h2= -£, Since m > 0, we have the two imaginary num- 
bers A, = —i x 42 =1 fC when k > 0. We had not reckoned on this possibility; 


however, let us continue our study. By Euler’s formula 


k i 
at 1 — cos | isin, [E, 

m m 

Jee [k : 
e! k/mt _ cog Ky isin, 

m m 


Since differentiating with respect to the real variable t amounts to differentiating 
the real and imaginary parts of the function e*” separately, Eq. (5.157) must be 


satisfied by both functions cos ,/ ky and sin ,/ xe, And this is indeed the case, as 


one can easily verify directly. Thus the complex exponential function has enabled 
us to guess two solutions of Eq. (5.157), any linear combination of which 


[k [k 
x(t) =c cos ,/—t + c2sin,/ —t, (5.159) 
m m 


is obviously also a solution of Eq. (5.157). 
We choose the coefficients cy and c2 in (5.159) from the condition 
Xp = x(0) =c), 


: k k k k k 
vg = x(0) = | —c1,/ — sin,/ —t+c2,/—cos,/—t =c€2,/—. 
m m m m /|\,<09 m 
Thus the function 
k m . k 
x(t) = x9 cos,/ —t + v9,/ — sin,/ —t (5.160) 
m k m 


is the required solution. 
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By making standard transformations we can rewrite (5.160) in the form 


2 am. k 
x(t) = x9 + 97 sin aoe ; (5.161) 


x0 
2 2m 
V¥%0 + %F 


Thus, for k > 0 the point will make periodic oscillations with period T = 27 Jt ; 


where 


a = arcsin 


that is, with frequency r = py B , and amplitude , eos + up t. We state this be- 
cause it is clear from physical considerations that the solution (5.160) is unique. 
(See Problem 5 at the end of this section.) 

The motion described by (5.161) is called a simple harmonic oscillation, and 
Eq. (5.157) the equation of a simple harmonic oscillator. 

Let us now turn to the case when k < 0 in Eq. (5.158). Then the two func- 


tions e*1! = exp(—,/— 1) and ¢72! = exp( —*p) are real-valued solutions of 


Eq. (5.157) and the function 
x(t) =cye*"! + cre*! (5.162) 


is also a solution. We choose the constants c, and c2 from the conditions 


x0 =x(0) =c1 +2, 
v9 = X(O) =cyAy + C2A2. 

This system of linear equations always has a unique solution, since its determi- 
nant A2 — Aq is not 0. 

Since the numbers A; and A2 are of opposite sign, it can be seen from (5.162) 
that for k < 0 the force F = —kx not only has no tendency to restore the point to 
its equilibrium position at x = 0, but in fact as time goes on, carries it an unlimited 
distance away from this position if x9 or vg is nonzero. That is, in this case x = 0 is 
a point of unstable equilibrium. 

In conclusion let us consider a very natural modification of Eq. (5.156), in which 
the usefulness of the exponential function and Euler’s formula connecting the basic 
elementary functions shows up even more clearly. 

Let us assume that the particle we are considering moves in a medium (the air 
or a liquid) whose resistance cannot be neglected. Suppose the resisting force is 
proportional to the velocity of the point. Then, instead of Eq. (5.156) we must write 


mx(t) = —ax(t) — kx(t), 
which we rewrite as 


: a. k 
X(t) + —x(t) + —x(t) =0. (5.163) 
m m 
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If once again we seek a solution of the form x(t) = e*', we arrive at the quadratic 
equation 


whose roots are 41,2 = —5— 
The case when a? — 4mk > 0 leads to two real roots 41 and Az, and the solution 
can be found in the form (5.162). 
We shall study in more detail the case in which we are more interested, when 
a* — 4mk <0. Then both roots 4; and Az are complex, but not purely imaginary: 


a WV4mk — a? 
M= i ; 

2m 2m 

a V4mk — a? 
A2= +i : 

2m 2m 


In this case Euler s formula yields 
Ait __ a a 
e*!' = exp( ——t } (cost — isin(af), 
2m 
eft! — exp(- <1) (coswt +isinet), 
2m 


where w = al ina Thus we find the two real-valued solutions exp(— im) cos wt 
and exp(— =) sinwt of Eq. (5.163), which would have been very difficult to guess. 
We then seek a solution of the original equation in the form of a linear combination 
of these two 


x) =exp(— Ft Jt cos wt + c2 sinwt), (5.164) 


choosing c; and c2 so that the initial conditions x(0) = x9 and x (0) = vo are satis- 
fied. 

The system of linear equations that results, as one can verify, always has a unique 
solution. Thus, after transformations, we obtain the solution of the problem from 
(5.164) in the form 


x= Aesp(- 31) sin(wt +a), (5.165) 
m 


where A and a are constants determined by the initial conditions. 
It can be seen from this formula that, because of the factor exp(— 57,1), when 
a > 0 and m > 0, the oscillations will be damped and the rate of damping of 
a 1 


the amplitude depends on the ratio —. The frequency of the oscillations s-w = 


a / £ - Gr will not vary over time. The quantity w also depends only on the 
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ratios A and ai which, however, could have been foreseen from the form (5.163) of 
the original equation. When a = 0, we again return to undamped harmonic oscilla- 
tions (5.161) and Eq. (5.157). 


5.6.7 Problems and Exercises 


1. Efficiency in rocket propulsion. 


a) Let Q be the chemical energy of a unit mass of rocket fuel and @ the out- 
flow speed of the fuel. Then 5a" is the kinetic energy of a unit mass of fuel when 


ejected. The coefficient a in the equation 1G? 


z@° = aQ is the efficiency of the pro- 

cesses of burning and outflow of the fuel. For engines of solid fuel (smokeless pow- 

der) w = 2 km/s and Q = 1000 kcal/kg, and for engines of liquid fuel (gasoline 

with oxygen) w = 3 km/s and Q = 2500 kcal/kg. Determine the efficiency a for 
these cases. 

b) The efficiency of a rocket is defined as the ratio of its final kinetic energy 


mr* to the chemical energy of the fuel burned mp Q. Using formula (5.139), obtain 
a formula for the efficiency of a rocket in terms of mp, mp, Q, and a (see part a)). 

c) Evaluate the efficiency of an automobile with a liquid-fuel jet engine, if the 
automobile is accelerated to the usual city speed limit of 60 km/h. 

d) Evaluate the efficiency of a liquid-fuel rocket carrying a satellite into low 
orbit around the earth. 

e) Determine the final speed for which rocket propulsion using liquid fuel is 
maximally efficient. 

f) Which ratio of masses mp/mp yields the highest possible efficiency for any 
kind of fuel? 


2. The barometric formula. 


a) Using the data from Sect. 5.6.2, obtain a formula for a correction term to take 
account of the dependence of pressure on the temperature of the air column, if the 
temperature is subject to variation (for example, seasonal) within the range +40 °C. 

b) Use formula (5.144) to determine the dependence of pressure on elevation at 
temperatures of —40 °C, 0 °C, and 40 °C, and compare these results with the results 
given by your approximate formula from part a). 

c) The change in temperature of the atmosphere at an altitude of 10-11 km is 
well described by the following empirical formula: T (h) = To — ah, where To is the 
temperature at sea level (at h = 0 m), the coefficient a = 6.5 x 1073 K/m, and h is 
the height in meters. Deduce under these conditions the formula for the dependence 
of the pressure on the height. (7p it is often given the value 288 K, corresponding to 
15 °C.) 

d) Find the pressure in a mine shaft at depths of 1 km, 3 km and 9 km using 
formula (5.144) and the formula that you obtained in c). 
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e) Independently of altitude, air consists of approximately 1/5 oxygen. The par- 
tial pressure of oxygen is also approximately 1/5 of the air pressure. A certain 
species of fish can live under a partial pressure of oxygen not less than 0.15 atmo- 
spheres. Should one expect to find this species in a river at sea level? Could it be 
found in a river emptying into Lake Titicaca at an elevation of 3.81 km? 


3. Radioactive decay. 


a) By measuring the amount of a radioactive substance and its decay products in 
ore samples of the Earth, assuming that no decay products were originally present, 
one can estimate the age of the Earth (at least from the time when the substance 
appeared). Suppose that in a rock there are m grams of a radioactive substance and 
r grams of its decay product. Knowing the half-life T of the substance, find the time 
elapsed since the decay began and the amount of radioactive substance in a sample 
of the same volume at the initial time. 

b) Atoms of radium in an ore constitute approximately 10~ |? of the total number 
of atoms. What was the radium content 10°, 10°, and 5 x 10° years ago? (The age 
of the Earth is estimated at 5 x 10° years.) 

c) In the diagnosis of kidney diseases one often measures the ability of the kid- 
neys to remove from the blood various substances deliberately introduced into the 
body, for example creatin (the “clearance test’). An example of an opposite process 
of the same type is the restoration of the concentration of hemoglobin in the blood 
of a donor or of a patient who has suddenly lost a large amount of blood. In all these 
cases the decrease in the quantity of the substance introduced (or, conversely, the 
restoration of an insufficient quantity) is subject to the law N = Noe~'/", where N 
is the amount (in other words, the number of molecules) of the substance remaining 
in the body after time t has elapsed from the introduction of the amount No and 
tT is the so-called lifetime: the time elapsed when 1/e of the quantity originally in- 
troduced remains in the body. The lifetime, as one can easily verify, is 1.44 times 
larger than the half-life, which is the time elapsed when half of the original quantity 
of the substance remains. 

Suppose a radioactive substance leaves the body at a rate characterized by the 
lifetime to, and at the same time decays spontaneously with lifetime tg. Show that 
in this case the lifetime t characterizing the time the substance remains in the body 
is determined by the relation t~! = T) ee Ta. Hs 

d) A certain quantity of blood containing 201 mg of iron has been taken from a 
donor. To make up for this loss of iron, the donor was ordered to take iron sulfate 
tablets three times a day for a week, each tablet containing 67 mg of iron. The 
amount of iron in the donor’s blood returns to normal according to an exponential 
law with lifetime equal to approximately seven days. Assuming that the iron from 
the tablets enters the bloodstream most rapidly immediately after the blood is taken, 
determine approximately the portion of the iron in the tablets that will enter the 
blood over the time needed to restore the normal iron content in the blood. 

e) A certain quantity of radioactive phosphorus P?* was administered to diag- 
nose a patient with a malignant tumor, after which the radioactivity of the skin 
of the thigh was measured at regular time intervals. The decrease in radioactivity 
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was subject to an exponential law. Since the half-life of phosphorus is known to be 
14.3 days, it was possible to use the data thus obtained to determine the lifetime 
for the process of decreasing radioactivity as a result of biological causes. Find this 
constant if it has been established by observation that the lifetime for the overall 
decrease in radioactivity was 9.4 days (see part c) above). 


4. Absorption of radiation. The passage of radiation through a medium is accom- 
panied by partial absorption of the radiation. In many cases (the linear theory) one 
can assume that the absorption in passing through a layer two units thick is the same 
as the absorption in successively passing through two layers, each one unit thick. 


a) Show that under this condition the absorption of radiation is subject to the 
law I = Ipe~*, where Jo is the intensity of the radiation falling on the absorbing 
substance, / is the intensity after passing through a layer of thickness /, and k is a 
coefficient having the physical dimension inverse to length. 

b) In the case of absorption of light by water, the coefficient k depends on the 
wave length of the incident light, for example as follows: for ultraviolet k = 1.4 x 
10-2 cm7!; for blue k = 4.6 x 1074 cm7!; for green k = 4.4 x 10-4 cm7!; for red 
k =2.9 x 1073 cm7!. Sunlight is falling vertically on the surface of a pure lake 10 
meters deep. Compare the intensities of these components of sunlight listed above 
the surface of the lake and at the bottom. 


5. Show that if the law of motion of a point x = x(t) satisfies the equation mx + 
kx = 0 for harmonic oscillations, then 


cw 2 
a) the quantity E = me) + mo is constant (E = K + U is the sum of the 


oo: 2 
kinetic energy K = mito) of the point and its potential energy U = i at time f); 


b) if x(0) = 0 and x(0) = 0, then x(t) =0; 

c) there exists a unique motion x = x(t) with initial conditions x(0) = xo and 
X(0) = v9; 

d) Verify that if the point moves in a medium with friction and x = x(f) satisfies 
the equation mx + ax +kx =0, a > 0, then the quantity E (see part a)) decreases. 
Find its rate of decrease and explain the physical meaning of the result, taking ac- 
count of the physical meaning of E. 


6. Motion under the action of a Hooke** central force (the plane oscillator). 

To develop Eq. (5.156) for a linear oscillator in Sect. 5.6.6 and in Problem 5 let us 
consider the equation mr(t) = —kr(t) satisfied by the radius-vector r(t) of a point 
of mass m moving in space under the attraction of a centripetal force proportional to 
the distance |r(t)| from the center with constant of proportionality (modulus) k > 0. 
Such a force arises if the point is joined to the center by a Hooke elastic connection, 
for example, a spring with constant k. 


34R. Hooke (1635-1703) — British scientist, a versatile scholar and experimenter. He discovered 
the cell structure of tissues and introduced the word cell. He was one of the founders of the math- 
ematical theory of elasticity and the wave theory of light; he stated the hypothesis of gravitation 
and the inverse-square law for gravitational interaction. 
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a) By differentiating the vector product r(t) x r(t), show that the motion takes 
place in the plane passing through the center and containing the initial position 
vector ro = r(to) and the initial velocity vector rg = r(tg) (a plane oscillator). If 
the vectors ro = r(to) and rp = r(fto) are collinear, the motion takes place along 
the line containing the center and the vector ro (the linear oscillator considered in 
Sect. 5.6.6). 

b) Verify that the orbit of a plane oscillator is an ellipse and that the motion is 
periodic. Find the period of revolution. 

c) Show that the quantity E = mr? (t) + kr?(t) is conserved (constant in time). 

d) Show that the initial data ro = r(fo) and ro = r(fo) completely determine the 
subsequent motion of the point. 


7. Ellipticity of planetary orbits. The preceding problem makes it possible to regard 
the motion of a point under the action of a central Hooke force as taking place in 
a plane. Suppose this plane is the plane of the complex variable z = x + iy. The 
motion is determined by two real-valued functions x = x(t), y = y(t) or, what is the 
same, by one complex-valued function z = z(t) of time t. Assuming for simplicity 
in Problem 6 that m = | and k = 1, consider the simplest form of the equation of 
such motion Z(t) = —z(ft). 


a) Knowing from Problem 6 that the solution of this equation corresponding to 
the specific initial data zo = z(to), Zo = Z(fo) is unique, find it in the form z(t) = 
cye’ + coe"! and, using Euler’s formula, verify once again that the trajectory of 
motion is an ellipse with center at zero. (In certain cases it may become a circle or 
degenerate into a line segment — determine when.) 

b) Taking account of the invariance of the quantity |z(t)|? + \z(t)|? during the 
motion of a point z(t) subject to the equation z(t) = —z(t), verify that, in terms of 
a new (time) parameter t connected with f by a relation tT = T(t) such that ee = 
|z(t)|?, the point w(t) = z(t) moves subject to the equation ew = ere where 
c is aconstant and w = w(t(t)). Thus motion in a central Hooke force field and 
motion in a Newtonian gravitational field turn out to be connected. 

c) Compare this with the result of Problem 8 of Sect. 5.5 and prove that planetary 
orbits are ellipses. 

d) If you have access to a computer, looking again at Euler’s method, explained 
in Sect. 5.6.5, first compute several values of e* using this method. (Observe that 
this method uses nothing except the definition of the differential, more precisely the 
formula f (x,) © f (xp—-1) + f’ (&p_h, where h = xy — Xn—1.) 

Now let r(t) = (x(t), y(t), ro = r(O) = C1, 0), ro = r(O) = (0, 1) and r(t) = 

—~ ;. Using the formulas 


V(t) © V(tn—-1) + V(tn-Vh, 
V(tn) © Vtn-1) + a(tr_-)h, 


where v(t) = r(t), a(t) = V(t) = F(t), use Euler’s method to compute the trajectory 
of the point. Observe its shape and how it is traversed by a point as time passes. 
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5.7 Primitives 


In differential calculus, as we have verified on the examples of the previous section, 
in addition to knowing how to differentiate functions and write relations between 
their derivatives, it is also very valuable to know how to find functions from rela- 
tions satisfied by their derivatives. The simplest such problem, but, as will be seen 
below, a very important one, is the problem of finding a function F(x) knowing its 
derivative F’(x) = f(x). The present section is devoted to an introductory discus- 
sion of that problem. 


5.7.1 The Primitive and the Indefinite Integral 


Definition 1 A function F(x) is a primitive of a function f(x) on an interval if F 
is differentiable on the interval and satisfies the equation F’(x) = f(x), or, what is 
the same, dF (x) = f(x) dx. 


1 


Example I The function F(x) = arctan x is a primitive of f(x) = is 


1 
1+x2° 


—,; on the entire 


real line, since arctan’ x = 


Example 2 The function F(x) = arccot + — is a primitive of f(x) = u Toa? on the set 


of positive real numbers and on the set of negative real numbers, since Yor x #0 


wie. t a oe 
aad acral a) =e. 


What is the situation in regard to the existence of a primitive, and what is the set 
of primitives of a given function? 

In the integral calculus we shall prove the fundamental fact that every function 
that is continuous on an interval has a primitive on that interval. 

We present this fact for the reader’s information, but in the present section we 
shall essentially use only the following characteristic of the set of primitives of a 
given function on an interval, already known to us (see Sect. 5.3.1) from Lagrange’s 
theorem. 


Proposition 1 [f F\(x) and F(x) are two primitives of f (x) on the same interval, 
then the difference F\(x) — F2(x)is constant on that interval. 


The hypothesis that F; and F2 are being compared on a connected interval is 
essential, as was pointed out in the proof of this proposition. One can also see this 
by comparing Examples | and 2, in which the derivatives of F; (x) = arctanx and 
F(x) = arccot i agree on the entire domain R\0 that they have in common. How- 
ever, 


1 
F\(x) — Fo(x) = arctan x — arccot — = arctanx — arctanx = 0, 
x 
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for x > O while F\(x) — Fo(x) = —7z for x < 0. For if x < 0, we have arccot + — 
mw + arctanx. 

Like the operation of taking the differential, which has the name “differentia- 
tion” and the mathematical notation dF (x) = F’(x) dx, the operation of finding a 
primitive has the name “indefinite integration” and the mathematical notation 


[ fea, (5.166) 


called the indefinite integral of f (x) on the given interval. 

Thus we shall interpret the expression (5.166) as a notation for any of the primi- 
tives of f on the interval in question. 

In the notation (5.166) the sign is called the indefinite integral sign, f is called 
the integrand, and f (x) dx is called a differential form. 

It follows from Proposition | that if F(x) is any particular primitive of f(x) on 
the interval, then on that interval 


/ f(x) dx = F(x) +C, (5.167) 


that is, any other primitive can be obtained from the particular primitive F(x) by 
adding a constant. 

If F’(x) = f(x), that is, F is a primitive of f on some interval, then by (5.167) 
we have 


af f (x) dx =dF(x) = F'(x) dx = f(x) de. (5.168) 


Moreover, in accordance with the concept of an indefinite integral as any primi- 
tive, it also follows from (5.167) that 


[orw=f reaareo+e. (5.169) 


Formulas (5.168) and (5.169) establish a reciprocity between the operations of 
differentiation and indefinite integration. These operations are mutually inverse up 
to the undetermined constant C that appears in (5.169). 

Up to this point we have discussed only the mathematical nature of the constant C 
in (5.167). We now give its physical meaning using a simple example. Suppose a 
point is moving along a line in such a way that its velocity u(t) is known as a 
function of time (for example, v(t) = v). If x(¢) is the coordinate of the point at 
time ft, the function x(t) satisfies the equation x(t) = v(t), that is, x(t) is a primitive 
of v(t). Can the position of a point on a line be recovered knowing its velocity 
over a certain time interval? Clearly not. From the velocity and the time interval 
one can determine the length s of the path traversed during this time, but not the 
position on the line. However, the position will also be completely determined if it 
is given at even one instant, for example, t = 0, that is, we give the initial condition 
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x(0) = xo. Until the initial condition is given, the law of motion could be any law 
of the form x(t) = x(t) +c, where x(t) is any particular primitive of v(t) and c is 
an arbitrary constant. But once the initial condition x(0) = x(0) + c = xo is given, 
all the indeterminacy disappears; for we must have x(0) = x(0) +c = xo, that is, 
c= Xo — X(0) and x(t) = x9 + [x(1) — x(0)]. This last formula is entirely physical, 
since the arbitrary primitive x appears in it only as the difference that determines 
the path traversed or the magnitude of the displacement from the known initial point 
x(0) = xo. 


5.7.2 The Basic General Methods of Finding a Primitive 


In accordance with the definition of the expression (5.166) for the indefinite inte- 
gral, this expression denotes a function whose derivative is the integrand. From this 
definition, taking account of (5.167) and the laws of differentiation, one can assert 
that the following relations hold: 


a. [lowe + poon)ax=a fwd +6 f vende +e, (5.170) 


b. osyar= fu'covayar+ [ucyr'ayar te. (5.171) 
c. If 


[ fears Fe +e 


onan interval I, and p : I; > I, is a smooth (continuously differentiable) mapping 
of the interval I, into I, then 


[room @ma= (Fon +e (5.172) 


The equalities (5.170), (5.171), and (5.172) can be verified by differentiating the 
left- and right-hand sides using the linearity of differentiation in (5.170), the rule 
for differentiating a product in (5.171), and the rule for differentiating a composite 
function in (5.172). 

Just like the rules for differentiation, which make it possible to differentiate lin- 
ear combinations, products, and compositions of known functions, relations (5.170), 
(5.171), and (5.172), as we shall see, make it possible in many cases to reduce the 
search for a primitive of a function either to the construction of primitives for sim- 
pler functions or to primitives that are already known. A set of such known primi- 
tives can be provided, for example, by the following short table of indefinite inte- 
grals, obtained by rewriting the table of derivatives of the basic elementary functions 
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(see Sect. 5.2.3): 


[oro — x*tlie (a#-1), 


J 
i 


dx =In|x| +c, 


Sle 


a dx = a +c (0<aFl), 
[eaaerte, 
[sinxdx = cos + 


[ross dx =sinx +c, 


1 
lane =tanx+c, 
COs* x 


1 
ant dx =—cotx+c, 


sin” x 


arcsinx +c, 
—arccosx + ¢, 


lz V1— oie 
| 1 foes 
—{ dx = 
14+ x2 —arccotx +c, 
i sinhx dx = coshx + c, 
[costs dx =sinhx+c, 
1 
l= = tanhx +c, 
cosh* x 
1 
/ dx = —cothx +c, 


sinh? x 
| eRerr hk v9 + 1| +e, 


1 1 1 
/ des il 
1—x2 2 1l-x 


Each of these formulas is used on the intervals of the real line R on which the cor- 
responding integrand is defined. If more than one such interval exists, the constant 
c on the right-hand side may change from one interval to another. 


+. 
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Let us now consider some examples that show relations (5.170), (5.171) and 
(5.172) in action. We begin with a preliminary remark. 

Given that, once a primitive has been found for a given function on an interval 
the other primitives can be found by adding constants, we shall agree to save writing 
below by adding the arbitrary constant only to the final result, which is a particular 
primitive of the given function. 


a. Linearity of the Indefinite Integral 


This heading means that by relation (5.170) the primitive of a linear combination 
of functions can be found as the same linear combination of the primitives of the 
functions. 


Example 3 
[lobar + tays”) de = 
= ay f tartar f xdx+ stay f x" ax = 
panel ant ew aa 
2. n+1" , 
Example 4 
i‘ = “4 / 242V/F+~)a 
Xx —— x = Xx Xx _ xt = 
afk x 
1 
= [Porta fx!Par+ f -ax= 
x 
1 4 
= art ge? +inial+e. 
Example 5 
/ 27 4 [oat )d 5 fat )d 
cos’ —~dx = | = cosx) dx = = cosx) dx = 
2 2 2 
5 f tacts | ae 
eae = COs = — sin - 
5 a+ 5 x ae tz sing te 
b. Integration by Parts 


Formula (5.171) can be rewritten as 


u(xyw(x) = f uydv(a) + f ve) du(x) +e 
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or, what is the same, as 


J wen dve) =ucnv(a) ~ f ve duix) +e. (5.171') 


This means that in seeking a primitive for the function u(x)v’(x) one can reduce 
the problem to finding a primitive for v(x)u'(x), throwing the differentiation onto 
the other factor and partially integrating the function, as shown in (5.171’), sepa- 
rating the term u(x)u(x) when doing so. Formula (5.171’) is called the formula for 
integration by parts. 


1 
forxde=axinx— fxdinx=xinx— f x-<ar= 
x 


=xinx— f lde= xine x0. 


Example 6 


Example 7 


jee dx = {2 de* = x*e* — Ga dx? = x7e* — 2 f xe" dx = 
= x7e* — 2 f de’ = x*e* — 2( xe — iG ax) = 


= x7e* —2xe* +2e% +c= (a> — 2x + 2)e* +c. 


c. Change of Variable in an Indefinite Integral 


Formula (5.172) shows that in seeking a primitive for the function (f 0 g)(t) - ’(t) 
one may proceed as follows: 


[rome -p'(t) dt = | Few) dg(t)= 


= / f(x)dx = F(x) +e = F(p) +6, 


that is, first make the change of variable g(t) = x in the integrand and pass to the 
new variable x, then, after finding the primitive as a function of x, return to the old 
variable t by the substitution x = g(r). 


Example 8 


t dt Laer). 1fde 1 iw 
= = = —_—| = —In(t 1 ; 
lo Al 142 Al Pate ae Ge ae a 
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Example 9 


‘i dx | dx | d(5) 
A = XxX x x Tx 
sin x 2 sin 5 cos 5 tan 5 cos? 5 
i. du / d(tan u) du 
~ J tanucos2u tanu oJ ov 


+c. 


=In|v| +c =In|tanu|+c=In 


x 
tan — 
2 


We have now considered several examples in which properties a, b, and c of the 
indefinite integral have been used individually. Actually, in the majority of cases, 
these properties are used together. 


Example 10 


1 
[snr cos 3x dx = 5 [oinss — sinx)dx = 


1 
— 5( f sinseax— f sinxax) = 
=e | in eydGn eae) < 
a) 5 Sin OX x COSxX |} = 


1 1 1 1 
= yf sinudu + 5 osx = — TF cosu + 5 cosx +e = 


1 1 
= =cosx — io 


2 
Example 11 
/ arcsin x dx = x arcsinx — [: darcsinx = 


1 fdd—x?) | 
2) JT—x2 


1 
= x arcsinx + 5 fw? du = x aresins + ul? + 0= 


3 x . 
= xaresin — f/ dx = xaresin s+ 
V1—x2 

=xaresinx + V1—x2+c. 


Example 12 


1 
1 cos bx dx = ~ [ 0s bx de = 
a 
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1 1 
—e™ cosbx — — / e™ dcosbx = 
a a 


1 b 
—e™ cos bx + — / e™ sinbx dx = 
a a 


1 b 
—e™ cos bx + = / sinbx de = 
a a 


1 b b 
= —e™ cos bx + ce sinbx — = / e™ dsinbx = 
a a a 


cosbx + bsinb b? 
at * 5 “ 5 | ef cosbx dx. 
a a 


From this result we conclude that 


acosbx + bsinbx 


es cos bx dx = Tp e* +e 
a 


We could have arrived at this result by using Euler’s formula and the fact that the 
primitive of the function e@+!”)* = e% cos bx + ie” sin bx is 


i e(atib)x _ a—ib elatib)x _ 
a+ib a2 + b2 
acosbx + bsinbx di, SINR BOSD: oe 
= e i ——_.—__.——_ 
a+b? a* + b? 


It will be useful to keep this in mind in the future. For real values of x this 
can easily be verified directly by differentiating the real and imaginary parts of the 
function a peletils : 

In particular, we also find from this result that 


sinbx — bcosb 
fe Snbxde = — < rw * ett +c. 


Even the small set of examples we have considered suffices to show that in seek- 
ing primitives for even the elementary functions one is often obliged to resort to 
auxiliary transformations and clever devices, which was not at all the case in find- 
ing the derivatives of compositions of the functions whose derivatives we knew. It 
turns out that this difficulty is not accidental. For example, in contrast to differen- 
tiation, finding the primitive of an elementary function may lead to a function that 
is no longer a composition of elementary functions. For that reason, one should 
not conflate the phrase “finding a primitive” with the sometimes impossible task 
of “expressing the primitive of a given elementary function in terms of elementary 
functions”. In general, the class of elementary functions is a rather artificial object. 
There are very many special functions of importance in applications that have been 
studied and tabulated at least as well as, say sin. x or e*. 
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For example, the sine integral Six is the primitive [ sina dx of the function 2 
that tends to zero as x — 0. There exists such a primitive, but, like all the other 
primitives o sin | it is not a composition of elementary functions. 


Similarly, the function 


. cos x 
cix= f dx, 
x 


specified by the condition Cix — 0 as x > oo is not elementary. The function Cix 
is called the cosine He a 

The primitive f< of the function = is also not elementary. One of the primi- 
tives of this fiacton: is denoted lix and is called the logarithmic integral. It satisfies 
the condition lix — 0 as x > +0. (More details about the functions Six, Cix, and 
lix will be given in Sect. 6.5.) 

Because of these difficulties in finding primitives, rather extensive tables of in- 
definite integrals have been compiled. However, in order to use these tables success- 
fully and avoid having to resort to them when the problem is very simple, one must 
acquire some skill in dealing with indefinite integrals. 

The remainder of this section is devoted to integrating some special classes of 
functions whose primitives can be expressed as compositions of elementary func- 
tions. 


5.7.3 Primitives of Rational Functions 


PO) 
Q(x) 


Let us consider the problem of integrating ‘i R(x) dx, where R(x) = is a ratio 
of polynomials. 

If we work in the domain of real numbers, then, without going outside this do- 
main, we can express every such fraction, as we know from algebra (see formula 


(5.135) in Sect. 5.5.4) as a sum 


P(x) <A bjax +e jk 
— = — ], 65.173 
Ou)?“ m2(Sa ate) (S| o— 


j=l \k=1 


where p(x) is a polynomial (which arises when P(x) is divided by Q(x), but only 
when the degree of P(x) is not less than the degree of Q(x)), ajx, Djx, and c jx are 
uniquely determined real numbers, and Q(x) = (x — xy)R eee (= xy) (x? + Pixt+ 
qt (xe + Pax +n)". 

We have already discussed how to find the expansion (5.173) in Sect. 5.5. Once 
the expansion (5.173) has been constructed, integrating R(x) reduces to integrating 
the individual terms. 

We have already integrated a polynomial in Example |, so that it remains only to 
consider the integration of fractions of the forms 


1 bx +c 


—— d —————_., here k EN. 
G—we an Cerra: where 
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The first of these problems can be solved immediately, since 


1 (x —a)y 146 fork #1, 
fer =EHT ~ 4) . 7 (5.174) 
(x —a) In|jx —al+c fork=1. 
With the integral 
bx+c 
(x? G2+pxtge 


we proceed as follows. We represent the polynomial x* + px +g as (x + 5 py) + 
(q- i p~), where q — i p* > 0, since the polynomial x? + px +q has no real roots. 


Setting x + SP =u and q — iP’ =a’, we obtain 
bx +c au+ Bp 
dx —— = du, 
(x2 + px t+ q)k (u2 + a2)k 


where a = b and B =c — 5bp. 


Next, 
| u p 1 f du? +a?) 
u= = 
(u2 + a)k 2 (u2 + a2)k 


sap? ta?! fork #1, 
~ | din? +a’) fork =1, 


(5.175) 


and it remains only to study the integral 


du 
i= | oe (5.176) 


Integrating by parts and making elementary transformations, we have 


uz 
du 
n= fay cae aR apa 
Uu (u2 +a’) — a u 2 
= Geaaee (2 + a2) u= (u2 + a2) + 2kIk — 2ka Tk, 


from which we obtain the recursion relation 


1 u i 2k—-1 
2ka2 (u2 + a2)k 2ka2 


lei = lig (5.177) 


which makes it possible to lower the exponent k in the integral (5.176). But J; is 
easy to compute: 


I | ee 1 Ne) 2258 gota & (5.178) 
= — => arc an = . 
; uta? aj 1+(#? a - 
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Thus, by using (5.177) and (5.178), one can also compute the primitive (5.176). 
Thus we have proved the following proposition. 

Proposition 2 The primitive of any rational function R(x) = ae can be expressed 

in terms of rational functions and the transcendental functions \n and arctan. The 

rational part of the primitive, when placed over a common denominator, will have 

a denominator containing all the factors of the polynomial Q(x) with multiplicities 

one less than they have in Q(x). 


2x2 +5x +5 


Example 13 Let us calculate | —.—————— dx. 
(x2 — 1)(x +2) 


Since the integrand is a proper fraction, and the factorization of the denominator 
into the product (x — 1)(x + 1)(x +2) is also known, we immediately seek a partial 
fraction expansion 


Ss ge ae 
(x—DatDa~t2) x-1 x+1 x42 


(5.179) 


Putting the right-hand side of Eq. (5.179) over a common denominator,we have 


2x? +5x+5 _ (A+ B4+C)x* + GA+t B)x + (24 -2B-C) 
(x —D@+1)@+2) ~ (x — Dat Dat+2) 


Equating the corresponding coefficients in the numerators, we obtain the system 


A+B+C=2, 
3A+B=5, 
2A—2B—C=5, 


from which we find (A, B, C) = (2, —1, 1). 

We remark that in this case these numbers could have been found in one’s head. 
Indeed, multiplying (5.179) by x — 1 and then setting x = 1 in the resulting equality, 
we would have A on the right-hand side, while the left-hand side would have been 
the value at x = | of the fraction obtained by striking out the factor x — | in the 
denominator, that is, A = alate = 2. One could proceed similarly to find B and C. 


Thus, 
oy 4 SS dx dx dx 
gy =2 4 = 
(x2 — 1)(x +2) x-—1 x+1 x+2 
= 2In|x —1|—In|x +1] +In|x +2|+c 
(x — 1)?(x +2) 


x-—1 


=In 


5.7 Primitives 317 
Example 14 Let us compute a primitive of the function 


x? — 2x84 4x5 — 5x44 4x3 — 5x2 -— x 
(x — 1)?7(@? + 1)? 


R(Qx)= 


We begin by remarking that this is an improper fraction, so that, removing the 
parentheses and finding the denominator Q(x) = x® — 2x> + 3x4 — 4x3 4+ 3x? — 
2x + 1, we divide the numerator by it, after which we obtain 


we — x44 x3 — 3x2 -— 2x 
(x — 1)?(x? + 1)? 


R(x) =xt+ 


’ 


and we then seek a partial-fraction expansion of the proper fraction 


xo — x44 x3 — 3x2 —2x A B Cx+D Ex+F 
2(y2 2 — z+ +735 aw) - (5.180) 
(x — 1)*(x2+ 1) (x — 1) x-1l  (**+1) x?7+1 


Of course the expansion could be obtained in the canonical way, by writing out 
a system of six equations in six unknowns. However, instead of doing that, we shall 
demonstrate some other technical possibilities that are sometimes used. 

We find the coefficient A by multiplying Eq. (5.180) by (x — 1)? and then setting 
x = 1. The result is A = —1. We then transpose the fraction a “a , in which A is 
now the known quantity —1, to the left-hand side of Eq. (5.180). We then have 


xAt xP 42x27 4x-1 _ B jp ED 5 Pee 
(x—Da24+)2  — x-1) (e241?) x? 41 


(5.181) 


from which, multiplying (5.181) by x — | and then setting x = 1, we find B = 1. 
Now, transposing the fraction 4 to the left-hand side of (5.181), we obtain 


x7 tx4+2_ Cx+D ge ec Te 
(x2 +1)? 7 (x21)? © x? 17 


(5.182) 


Now, after putting the right-hand side of (5.182) over a common denominator, 
we equate the numerators 


xe 4+x4+2=5 Ex? + Fx? +(C+E)x+(D+F), 


from which it follows that 


E=0, 
F=1, 
C+E=1, 
D+F=2, 


or (C, D, E, F) = (1, 1,0, 1). 
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We now know all the coefficients in (5.180). Upon integration, the first two frac- 
tions yield respectively <4 and In|x — 1|. Then 


Cx+D x+1 
x = x= 
Ge? “ a 
ae +1) 


1)" eee 1? ~ 262 a 


where 


1 x 1 
— = t; , 
2 / Ge cE 12 5 @ " D2 + ) arctan x 


which follows from (5.177) and (5.178). 


Finally, 
Ex+F 1 
dx = | ——~— dx = arctanx. 
x2+4+1 x2+1 


Gathering all the integrals, we finally have 


1 a Xx 
—1 2(x%?4+1) 


1 3 
[Roa = =x* + +In|x — 1] +  arctanx +c. 
2 x 2 2; 
Let us now consider some frequently encountered indefinite integrals whose 
computation can be reduced to finding the primitive of a rational function. 


5.7.4 Primitives of the Form { R(cos x, sinx) dx 


Let R(u, v) be a rational function in uw and v, that is a quotient of polynomials 
aa which are linear combinations of monomials uv", where m = 0, 1,2... 
andn=0,1,.... 

Several methods exist for computing the integral f R(cosx, sinx)dx, one of 


which is completely general, although not always the most efficient. 


a. 


We make the change of variable t = tan 5. Since 


1 — tan? a : 2 tans 
cosx = ———_=, sinx = ——_s_., 
1+ tan2 5 1+ tan* 5 

2dt 


= —,, thatis, dx = ——_, 
2 cos? s 1+ tan? 5 
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it follows that 


[x inx)d [R [= 2 dt 
cos x, sinx) dx = ‘ ; 
1477? 1472/1422 


and the problem has been reduced to integrating a rational function. 

However, this way leads to a very cumbersome rational function; for that reason 
one should keep in mind that in many cases there are other possibilities for rational- 
izing the integral. 


b. 


In the case of integrals of the form f R(cos? x, sin? x) dx or f r(tanx) dx, where 
r(u) is arational function, a convenient substitution is tf = tanx, since 


7) 1 a) tan? x 
cos* x = ————_, sin’ x = ———_., 
1+ tan? x 1+ tan? x 
dx ; dt 
dt = , thatis, dx=——,. 
cos? x 1+72 


Carrying out this substitution, we obtain respectively 


J R(c08?x,sin?x)ax = fr : : v Ma : 
147°14+7/14+?7 


[roan )d. [ro @ 
r x)dx = | r(t)——. 
1+? 


c. 
In the case of integrals of the form 
i R(cosx, sin’ x) sinxdx or / R(cos* x, sin x) cos x dx. 
One can move the functions sinx and cos x into the differential and make the sub- 


stitution tf = cosx or t = sinx respectively. After these substitutions, the integrals 
will have the form 


— f R@1-P)ar or [Rae nar 


Example 15 


if dx ai 1 2 dt 
ap Tn Qi 
3+ sinx aa et) 1+t 
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=2/ dt =? dt + 4) =2 du 
oe Es ae 


+c= 


3u 1 
arctan —— + c = —~ arctan 


2 aD v2 2/2 


tan 5+ 1 
= — = arctan —————__ +c. 


V2 2/2 


Here we have used the universal change of variable t = tan 5 


Example 16 


dx 
/ (sinx +cosx)? Y cos? x(tanx + 1)? 


dtanx -[< 1 de 
C= 
a 1)2 (t+ D2 t+1 
A +tanx’ 


Example 17 


dx _ dx 2 
a a =| Sow is 
1 dtan 3x 1 dt 
santor- inkl 


= 8y Saee 32-1 


1 / du u—1 di 
= C= 
3/6 J u2— u+l1 
ni 
(peel 
1 tmx 3 
= lt 
Be” 
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Example 18 
/ cos? x d / cos? x dsinx -{— d- eS dr 
x = 
sin’ x sin’ x 


1 1 1 
-7_ 4-5 6 4 
= t —t dt = — i + -t +c= +c. 
/ ( ) 6 4 Asin¢x  6sin®°x 


5.7.5 Primitives of the Form { R(x, y(x)) dx 


Let R(x, y) be, as in Sect. 5.7.4, a rational function. Let us consider some special 


integrals of the form 
/ R(x, y(x)) dx 


where y = y(x) is a function of x. 

First of all, it is clear that if one can make a change of variable x = x(t) such 
that both functions x = x(t) and y = y(x(f)) are rational functions of t, then x’(t) 
is also a rational function and 


[ 8.x) dx = [ Ro. reO)"0 dt, 


that is, the problem will have been reduced to integrating a rational function. 
Consider the following special choices of the function y = y(x). 


a. 
— nf ax+b n_ — axt+b * 
Ify= V5 Ta? where n €N, then, setting t org We obtain 
d 5 t” = 
{= a 
a-—c:-t" 4 


and the integrand rationalizes. 


Example 19 
= P+ P+ e+ 
IV: dv= fra ea ey ee i oe i= 
x+1 1-23 1-23 1-4 
e+ 2 
=t- 1)dt= 
1-4 Ce: ) 


P+ dt 
oe ee = 
1-4 (d—-nd+1r+2?) 


322 5 Differential Calculus 


_ 2t 2f 1 e yee Hee 
~ 1-7 a1—7) 30474 2)7 °° 


2f@¢+h+3 
Infl—1|—3 2) = dt = 
(t+ 5) +G 


2 
3 
zs ea t| Bi pat a 
= n n = =) — 
—t 3 2 


3 

2 ; 2 (:+3)+ one sei 
— — = arctan — = Cc, where f = — a 
V3 V3 2 sae ae 


b. 


Let us now consider the case when y = Vax? + bx +c, that is, integrals of the form 
if R(x, Vax? + bx + c) dx. 


By completing the square in the trinomial ax? + bx +c and making a suitable linear 
substitution, we reduce the general case to one of the following three simple cases: 


[Reverie [Re 1? — 1)dt, [Revie (5.183) 


To rationalize these integrals it now suffices to make the following substitutions, 
respectively: 


?+1=tut+l, or Vt2+1l=tu—-1, or Vt+1=t-u:; 
vt?-l=u(t—1), or Vt?-1l=u(t4+1), or Vt? —-1l=t—u:; 
vVl—#=u(l—t), or V1l—t#=u(1+4+t), or 1-12 =tut1. 


These substitutions were proposed long ago by Euler (see Problem 3 at the end 


of this section). 

Let us verify, for example, that after the first substitution we will have reduced 
the first integral to the integral of a rational function. 

In fact, if Jt? + 1 = tu + 1, then ? + 1 = 17u? + 2tu + 1, from which we find 


and then 
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Thus ¢ and \/t2 + 1 have been expressed rationally in terms of u, and consequently 
the integral has been reduced to the integral of a rational function. 


The integrals (5.183) can also be reduced, by means of the substitutions t = 
sinh gy, t = cosh@, and t = sing (or t = cos @) respectively, to the following forms: 


| R(sinhg, cosh@) cosh¢ dg, p R(coshg, sinh @) sinh g dg 
and 


[ Roving. cosp)cosedy or ~ | R(cose,sing) sing dg. 


Example 20 


/ dx =) dx = dt 
xtvVx2 42x42 $e DEI 1/2 FL 


Setting Vt? + 1 =u —t, we have 1 = u2 — 2tu, from which it follows that t = 
u2—1 
2u 


. Therefore 
/ dt | 1 i+ 1 4 | 1 dice 
a u= u 
t-1l4+VP41 2/ u-1 u2 2J u-il 


i | du 
2J u2(u—1) 


Lo eet 1 i 1\4 
= niu u= 
2 2 u-—1 uz sou 
lect oa Neale 
= n n . 
aa 2 Qu 


It now remains to retrace the path of substitutions: u=1+</t? + landt=x+1. 


c. Elliptic Integrals 


Another important class of integrals consists of those of the form 


[ R.VPo) dx, (5.184) 


where P(x) is a polynomial of degree n > 2. As Abel and Liouville showed, such 
an integral cannot in general be expressed in terms of elementary functions. 

For n = 3 and n = 4 the integral (5.184) is called an elliptic integral, and for 
n > 4 itis called hyperelliptic. 

It can be shown that by elementary substitutions the general elliptic integral can 
be reduced to the following three standard forms up to terms expressible in elemen- 
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tary functions: 


/ o (5.185) 
VU = x2) = kx?) 
x de (5.186) 
J = x2) = x2) 
/ i (5.187) 
(1+ hx2)/ (1 — x2)(1 — k2x?) 


where h and k are parameters, the parameter k lying in the interval ]0, 1[ in all three 
cases. 

By the substitution x = sing these integrals can be reduced to the following 
canonical integrals and combinations of them: 


dg 
a (5.188) 
1—k?sin* g 
fv — k* sin* gdg, (5.189) 
d 
i e (5.190) 
(1+hsin? 9)/1—k? sin? p 


The integrals (5.188), (5.189) and (5.190) are called respectively the elliptic in- 
tegral of first kind, second kind, and third kind (in the Legendre form). 

The symbols F(k, g) and E(k, g) respectively denote the particular elliptic in- 
tegrals (5.188) and (5.189) of first and second kind that satisfy F(k, 0) = 0 and 
E(k, 0) =0. 

The functions F(k, g) and E(k, g) are frequently used, and for that reason very 
detailed tables of their values have been compiled for0 <k < land0<g<z7/2. 

As Abel showed, it is natural to study elliptic integrals in the complex domain, in 
intimate connection with the so-called elliptic functions, which functions, which are 
related to the elliptic integrals exactly as the function sinx, for example, is related 
to the integral dv _ = arcsing. 


Vie 


5.7.6 Problems and Exercises 


1. Ostrogradskii’ s*° method of separating off the rational part of the integral of a 
proper rational fraction. 


35M.V. Ostrogradskii (1801-1861) — prominent Russian specialist in theoretical mechanics and 
mathematician, one of the founders of the applied area of research in the Petersburg mathematical 
school. 
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Let on = be a proper rational fraction, let g(x) be the polynomial having the same 


ath as ae: but with multiplicity 1, and let Q|(x) = rae 
Show that 


a) the following formula of Ostrogradskii holds: 


OD i win BOD a (5.191) 
Q(x) O1(x) q(x) 

where ae and Be 

function. 


(Because of this result, the fraction i 2 . in (5.191) is called the rational part of 


the integral f 4 o dx.) 
b) In the formula 


P(x) _ ( Pi(x) )+ P(x) 

Q(x) Q1(x) q(x) 
obtained by differentiating Ostrogradskii’s formula, the sum at the right hand side 
can be given the denominator Q(x) after suitable cancellations. 

c) The polynomials g(x), Q; (x), and then also the polynomials p(x), P,(x) can 
be found algebraically, without even knowing the roots of Q(x). Thus the rational 
part of the integral (5.191) can be found completely without even computing the 
whole primitive. 

d) Separate off the rational part of the integral (5.191) if 


P(x) = 2x° + 3x° + 6x4 + 6x3 + 10x? + 3x +2, 
O(x) = x7 + 3x® + 50° + 7x4 + 7x3 + 5x2 43x41 
(see Example 17 in Sect. 5.5). 


2. Suppose we are seeking the primitive 
J (osx, sinayar, (5.192) 


P(u,v) 


Ouv) 18a rational function. 


where R(u, v) = 
Show that 


a) if R(—u, v) = R(u, v), then R(u, v) has the form Rj (u?, v); 

b) if R(—u, v) = —R(u, v), then R(u, v) =u - R2(u*, v) and the substitution 
t = sinx rationalizes the integral (5.192); 

c) If R(—u, —v) = R(u, v), then R(u, v) = RC, v~), and the substitution t = 
tan x rationalizes the integral (5.192). 


3. Integrals of the form 


[ Ro. Var oreo) dx. (5.193) 
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a) Verify that the integral (5.193) can be reduced to the integral of a rational 
function by the following Euler substitutions: 


t=VJVax?+bx+ct Jax, ifa>0, 


xX-X 
t= 1 
x-Xx2 


if x2 and x2 are real roots of the trinomial ax? + bx +c. 


b) Let (xo, yo) be a point of the curve y* = ax? + bx + c and ¢ the slope of the 
line passing through (xo, yo) and intersecting this curve in the point (x, y). Express 
the coordinates (x, y) in terms of (xo, yo) and ¢ and connect these formulas with 
Euler’s substitutions. 

c) Acurve defined by an algebraic equation P(x, y) = 0 is unicursal if it admits 
a parametric description x = x(t), y = y(t) in terms of rational functions x(t) and 
y(t). Show that the integral ‘i R(x, y(x)) dx, where R(u, v) is a rational function 
and y(x) is an algebraic function satisfying the equation P(x, y) = 0 that defines 
the unicursal curve, can be reduced to the integral of a rational function. 

d) Show that the integral (5.193) can always be reduced to computing integrals 
of the following three types: 


/ P(x) ax | dx 

Jax? + bx +e (x —x0)k. Jax? +bx +e 
(Ax + B) dx 

/ (x2-+ px +a)™- Jax? +bx+e 


4. a) Show that the integral 


pe (a + ba")? dx 


whose differential is a binomial, where m, n, and p are rational numbers, can be 
reduced to the integral 


Jo + bt)? 14 dt, (5.194) 


where p and gq are rational numbers. 

b) The integral (5.194) can be expressed in terms of elementary functions if one 
of the three numbers p, g, and p+ q is an integer. (Chebyshev showed that there 
were no other cases in which the integral (5.194) could be expressed in elementary 
functions.) 


5. Elliptic integrals. 


a) Any polynomial of degree three with real coefficients has a real root xo, and 
can be reduced to a polynomial of the form t?(at* + bt? + ct* + dt + e), where 
a £0, by the substitution x — x9 = 17. 

b) If R(w, v) is a rational function and P a polynomial of degree 3 or 4, the func- 
tion R(x, ./P(x)) can be reduced to the form Rj (t, Vat4 + bt? +---+ e), where 


a0. 
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c) A fourth-degree polynomial ax* + bx* + --- + e can be represented as a 


product a(x? + pix +41)(x* + p2x + q2) and can always be brought into the form 
(Mi +N 17)(M3+No1) at+p 
(yt+lP yr" 
d) A function R(x, /ax* + bx3 +--- +e) can be reduced to the form 


by a substitution x = 


(t Jali +mjt?)(1 + m2t*)) 


at+B 


by a substitution x = yitl" 


e) A function R(x, ,/y) can be represented as a sum Rj (x, y) + , where 


Ro, y) 
. vy 
R, and R>2 are rational functions. 

f) Any rational function can be represented as the sum of even and odd rational 
functions. 

g) If the rational function R(x) is even, it has the form r(x2); if odd, it has the 
form xr (x7), where r(x) is a rational function. 


h) Any function R(x, ,/y) can be reduced to the form 


Ro(x?,y)  R3(x?, y) 
+ Xs 


vy vy 


i) Up toasum of elementary terms, any integral of the form i R(x, / P(x)) dx, 
where P(x) is a polynomial of degree four, can be reduced to an integral 


Ri(x,y) + 


r(t?) dt 
VAC + mit2)(1 + mot?) 


where r(f) is a rational function and A = +1. 
jp if igs > |m2| > 0, one 7 the substitutions ./mjt = x, ./mjt = V1 —x?, 
r(t?) dt 


/mit = =, and ./myt = = = will reduce the integral 
V1 Te g J a/ A(L+myt2)(1+myt?) 


to the form I cr 
fA —x2)(1—K2x2) 


k) Derive a formula for lowering the exponents 2 and m for the integrals 


where 0 < k < | and; is a rational function. 


/ x?" dx / dx 
Jd=-2)0—-Rx)) J (x2 -ayn- J = 2) — 2x2) 
1) Any elliptic integral 


[ R. VP) ax 


where P is a fourth-degree polynomial, can be reduced to one of the canonical forms 
(5.185), (5.186), (5.187), up to a sum of terms consisting of elementary functions. 
m) Express the integral aa in terms of canonical elliptic integrals. 


n) Express the primitives of the functions in terms of 


1 1 
cos 2x and /Cos a—Cos x 
elliptic integrals. 
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6. Using the notation introduced below, find primitives of the following nonelemen- 
tary special functions, up to a linear function Ax + B; 


x 
a) Ei(x) = i S dx (the exponential integral); 
x 


. sin x ae 
b) Si(x) = / — dx (the sine integral); 
x 


p COs x bn a 
c) Ci(x) = i dx (the cosine integral); 
x 


sinh x 


d) Shi(x) = i dx (the hyperbolic sine integral); 
e) Chi(x) = / 
scx) = f sins? ax 


g) cox) = f cosx? ax 


coshx 


dx (the hyperbolic cosine integral); 
(the Fresnel integrals); 


h) @(x) = / en?” dx (the Euler—Poisson integral); 
<i) dx Ae ean. 
i) li(x) = it ie (the logarithmic integral). 

nx 


7. Verify that the following equalities hold, up to a constant: 
a) Ei(x) = li(x); 
b) Chi(x) = 5[Ei(x) + Ei(—x)]; 
c) Shi(x) = 5[Ei(x) — Ei(—x)]; 
d) Ei(@ix) = Ci(x) +7 Six); 
e) el /4G(xe7!7/4) = C(x) +i S(x). 


8. A differential equation of the form 


dy f(x) 


dx = g(y) 


is called an equation with variables separable, since it can be rewritten in the form 


g(y)dy = f(x) dx, 


in which the variables x and y are separated. Once this is done, the equation can be 
solved as 


[ema f rears. 


by computing the corresponding integrals. 
Solve the following equations: 
a) 2x3 yy’ + y? =2; 


b) xyy’ = V1 +x; 
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c) y’=cos(y + x), setting u(x) = y(x) + x; 

d) x*y’ —cos2y = 1, and exhibit the solution satisfying the condition y(x) > 0 
as x > +00. 

e) ty/(x) = Six); 

8) Sr = CQ). 
9. A parachutist has jumped from an altitude of 1.5 km and opened the parachute at 
an altitude of 0.5 km. For how long a time did he fall before opening the parachute? 
Assume the limiting velocity of fall for a human being in air of normal density is 
50 m/s. Solve this problem assuming that the air resistance is proportional to: 


a) the velocity; 
b) the square of the velocity. 
Neglect the variation of pressure with altitude. 


10. It is known that the velocity of outflow of water from a small aperture at the 
bottom of a vessel can be computed quite precisely from the formula 0.6./2gH, 
where g is the acceleration of gravity and H the height of the surface of the water 
above the aperture. 

A cylindrical vat is set upright and has an opening in its bottom. Half of the water 
from the full vat flows out in 5 minutes. How long will it take for all the water to 
flow out? 

11. What shape should a vessel be, given that it is to be a solid of revolution, in 
order for the surface of the water flowing out of the bottom to fall at a constant rate 
as water flows out its bottom? (For the initial data, see Exercise 10.) 

12. In a workshop with a capacity of 10* m? fans deliver 10° m? of fresh air per 
minute, containing 0.04 % COd, and the same amount of air is vented to the outside. 
At 9:00 AM the workers arrive and after half an hour, the content of CO2 in the air 
rises to 0.12 %. Evaluate the carbon dioxide content of the air by 2:00 PM. 


Chapter 6 
Integration 


6.1 Definition of the Integral and Description of the Set 
of Integrable Functions 


6.1.1 The Problem and Introductory Considerations 


Suppose a point is moving along the real line, with s(t) being its coordinate at time f 
and v(t) =s'(t) its velocity at the same instant t. Assume that we know the position 
S(tg) of the point at time fy and that we receive information on its velocity. Having 
this information, we wish to compute s(t) for any given value of time t > fo. 

If we assume that the velocity u(t) varies continuously, the displacement of 
the point over a small time interval can be computed approximately as the prod- 
uct v(t) At of the velocity at an arbitrary instant t belonging to that time interval 
and the magnitude Ar of the time interval itself. Taking this observation into ac- 
count, we partition the interval [fo, tf] by marking some times ¢; (i = 0, ..., 2) so that 
to <t) <--- <t, =t and so that the intervals [t;_ |, t;] are small. Let At; = t; — t;_ 
and 7; € [t;—-1, t;]. Then we have the approximate equality 


n 


s(t) — s(t) © 0 v(t) Ag. 


i=1 


According to our picture of the situation, this approximate equality will become 
more precise if we partition the closed interval [fg, t] into smaller and smaller inter- 
vals. Thus we must conclude that in the limit as the length 4 of the largest of these 
intervals tends to zero we shall obtain an exact equality 


fim, 2 v(t;) At; = s(t) — s(to). (6.1) 


This equality is none other than the Newton—Leibniz formula (fundamental the- 
orem of calculus), which is fundamental in all of analysis. It enables us on the one 
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Fig. 6.1 


0 Xi Vi4+1 1 HO 


hand to find a primitive s(t) numerically from its derivative v(t), and on the other 
hand to find the limit of sums )77_, v(t;) At; on the left-hand side from a primitive 
s(t) found by any means whatever. 

Such sums, called Riemann sums, are encountered in a wide variety of situations. 

Let us attempt, for example, following Archimedes, to find the area under the 
parabola y = x* above the closed interval [0, 1] (see Fig. 6.1). Without going into 
detail here as to the meaning of the area of a figure, which we shall take up later, 
like Archimedes, we shall work by the method of exhausting the figure with simple 
figures — rectangles, whose areas we know how to compute. After partitioning the 
closed interval [0, 1] by points 0 = x9 < x1 <--- <x, = | into tiny closed intervals 
[xi-1, xi], we can obviously compute the required area o as the sum of the areas of 
the rectangles shown in the figure: 


n 
on yeas 
i=l 


here Ax; = x; — x;_-1. Setting f(x) = x? and &; = x;_1, we rewrite the formula as 


ox) f(Ei)AX. 


i=l 


In this notation we have, in the limit, 
n 

lim, d, fE)Ax =o (6.2) 
i= 


where, as above, A is the length of the longest interval [x;_1, x;] in the partition. 
Formula (6.2) differs from (6.1) only in the notation. Forgetting for a moment the 
geometric meaning of f(&;) Ax; and regarding x as time and f(x) as velocity, we 
find a primitive F(x) for the function f(x) and then, by formula (6.1) we find that 
o =F(1)-—F(O). 
In our case f(x) = x’, so that F(x) = 5x +c,ando = F(1) — F(0)= i: 
This is Archimedes’ result, which he obtained by a direct computation of the 
limit in (6.2). 
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A limit of integral sums is called an integral. Thus the Newton—Leibniz for- 
mula (6.1) connects the integral with the primitive. 

We now turn to precise formulations and the verification of what was obtained 
above on the heuristic level from general considerations. 


6.1.2 Definition of the Riemann Integral 


a. Partitions 


Definition 1 A partition P of a closed interval [a, b], a < b, is a finite system of 
points xo, ...,X, of the interval such that a = xg < x1 <---<x,=b). 
The intervals [xj~-1, xi], @ =1,..., 7) are called the intervals of the partition P. 
The largest of the lengths of the intervals of the partition P, denoted A(P), is 
called the mesh of the partition. 


Definition 2 We speak of a partition with distinguished points (P, &) on the closed 
interval [a, b] if we have a partition P of [a, b] and a point &; € [x;_1, x;] has been 
chosen in each of the intervals of the partition [x;-1, xj] @=1,...,7). 

We denote the set of points (1, ..., &) by the single letter &. 


b. A Base in the Set of Partitions 


In the set P of partitions with distinguished points on a given interval [a, b], we 
consider the following base 6 = {Bz}. The element Bz, d > 0, of the base 6 consists 
of all partitions with distinguished points (P, €) on [a, b] for which A(P) < d. 

Let us verify that {Bz}, d > 0 is actually a base in P. 

First Bg # @. In fact, for any number d > 0, it is obvious that there exists a 
partition P of [a, b] with mesh A(P) < d (for example, a partition into n congruent 
closed intervals). But then there also exists a partition (P,&) with distinguished 
points for which A(P) <d. 

Second, if dj > 0, d2 > 0, and d = min{d}, d}, it is obvious that By, N Bg, = 
Ba cB. 

Hence B = {Bz} is indeed a base in P. 


c. Riemann Sums 


Definition 3 If a function f is defined on the closed interval [a, b] and (P,é) isa 
partition with distinguished points on this closed interval, the sum 


OPES > fGiAm (6.3) 


i=1 
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where Ax; = x; — xj-1, is the Riemann sum of the function f corresponding to the 
partition (P, €) with distinguished points on [a, b]. 

Thus, when the function f is fixed, the Riemann sum o(f; P,&) is a function 
@®(p)=oa(f; p) on the set P of all partitions p = (P, €) with distinguished points 
on the closed interval [a, b]. 

Since there is a base 6 in P, one can ask about the limit of the function }(p) 
over that base. 


d. The Riemann Integral 


Let f be a function defined on a closed interval [a, b]. 


Definition 4 The number / is the Riemann integral of the function f on the closed 
interval [a, b] if for every ¢ > O there exists 6 > 0 such that 


<é 


I—)>) fG)Ax 


i=l 


for any partition (P, €) with distinguished points on [a, b] whose mesh A(P) is less 
than 6. 

Since the partitions p = (P,é) for which A(P) < 6 form the element Bs of the 
base B introduced above in the set P of partitions with distinguished points, Defi- 
nition 4 is equivalent to the statement 


T=lim®@(p), 
in (p) 


that is, the integral / is the limit over B of the Riemann sums of the function f 
corresponding to partitions with distinguished points on [a, b]. 

It is natural to denote the base B by A(P) — 0, and then the definition of the 
integral can be rewritten as 


n 


T= lim So FG) Ax. (6.4) 


A(P)>04 
i=1 


The integral of f(x) over [a, b] is denoted 


b 
/ f(x) dx, 


in which the numbers a and b are called respectively the lower and upper limits of 
integration. The function f is called the integrand, f (x) dx is called the differential 
form, and x is the variable of integration. Thus 


b n 
/ f(x)dx:= lim ody f EA. (6.5) 
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Definition 5 A function f is Riemann integrable on the closed interval [a, b] if the 
limit of the Riemann sums in (6.5) exists as A(P) — 0 (that is, the Riemann integral 
of f is defined). 

The set of Riemann-integrable functions on a closed interval [a, b] will be de- 
noted R[a, b]. 

Since we shall not be considering any integrals except the Riemann integral for a 
while, we shall agree for the sake of brevity to say simply “integral” and “integrable 
function” instead of “Riemann integral” and “Riemann-integrable function”. 


6.1.3 The Set of Integrable Functions 


By the definition of the integral (Definition 4) and its reformulation in the forms 
(6.4) and (6.5), an integral is the limit of a certain special function ®(p) = 
o(f; P,&), the Riemann sum, defined on the set P of partitions p = (P,&) with 
distinguished points on [a, b]. This limit is taken with respect to the base B in P 
that we have denoted A(P) > 0. 

Thus the integrability or nonintegrability of a function f on [a,b] depends on 
the existence of this limit. 

By the Cauchy criterion, this limit exists if and only if for every e > 0 there exists 
an element Bs € 6 in the base such that 


|P(p') — B(p")| <e 


for any two points p’, p” in Bs. 
In more detailed notation, what has just been said means that for any ¢ > O there 
exists 6 > O such that 


Jo(fs 6) —o (fi PE") <e 


or, what is the same, 


n’ n’ 


D> FE) Ax — DO FE) Axi] <e (6.6) 


i=l i=l 


for any partitions (P’,&") and (P”,”) with distinguished points on the interval 
[a, b] with A(P’) < 6 and A(P”) <6. 

We shall use the Cauchy criterion just stated to find first a simple necessary con- 
dition, then a sufficient condition for Riemann integrability. 


a. A Necessary Condition for Integrability 


Proposition 1 A necessary condition for a function f defined on a closed interval 
[a, b] to be Riemann integrable on [a, b] is that f be bounded on [a, b]. 
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In short, 


(f € Ra, b]) => (f is bounded on [a, b]). 


Proof If f is not bounded on [a, b], then for any partition P of [a,b] the func- 
tion f is unbounded on at least one of the intervals [x;_1,x;] of P. This means 
that, by choosing the point & € [xj~-1,x;] in different ways, we can make the 
quantity | f(&;) Ax;| as large as desired. But then the Riemann sum o(/f; P,é) = 
yr f (&) Ax; can also be made as large as desired in absolute value by changing 
only the point &; in this interval. 

It is clear that there can be no possibility of a finite limit for the Riemann sums in 
such a case. That was in any case clear from the Cauchy criterion, since relation (6.6) 
cannot hold in that case, even for arbitrarily fine partitions. 


As we shall see, the necessary condition just obtained is far from being both 
necessary and sufficient for integrability. However, it does enable us to restrict con- 
sideration to bounded functions. 


b. A Sufficient Condition for Integrability and the Most Important Classes 
of Integrable Functions 


We begin with some notation and remarks that will be used in the explanation to 
follow. 
We agree that when a partition P 


a=xj <x1 <°::<x,=b 


is given on the interval [a,b], we shall use the symbol A; to denote the interval 
[xj-1, x;] along with Ax; as a notation for the difference x; — x;_-1. 

If a partition P of the closed interval [a, b] is obtained from the partition P by 
the adjunction of new points to P, we call Pa refinement of P. 

When a refinement P of a partition P is constructed, some (perhaps all) of the 
closed intervals A; = [x;—, x;] of the partition P themselves undergo partitioning: 
Xi-1 = Xi9 < +++ < Xin; = x;. In that connection, it will be useful for us to label 
the points of P by double indices. In the notation x;; the first index means that 
xi; € A;, and the second index is the ordinal number of the point on the closed 
interval A;. It is now natural to set Axj; := xj; — xjj-1 and Ajj := [xjj-1, xij]. 
Thus Ax; = Axjj +---+ AXin;- 

As an example of a partition that is a refinement of both the partition P’ and P” 
one can take P = P’ U P”, obtained as the union of the points of the two partitions 
P’ and P”. 

We recall finally that, as before, w(f; E) denotes the oscillation of the function 
f on the set E, that is 


w(f;E):= sup |f(x’) — f(x”). 


Xx’ xX"EE 
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In particular, w(f; A;) is the oscillation of f on the closed interval A;. This oscil- 
lation is necessarily finite if f is a bounded function. 
We now state and prove a sufficient condition for integrability. 


Proposition 2 A sufficient condition for a bounded function f to be integrable on 
a closed interval [a, b] is that for every ¢ > 0 there exists a number 5 > 0 such that 


Sof: Aj) Ax; <& 


i=1 
for any partition P of [a,b] with mesh X(P) <6. 
Proof Let P be a partition of [a, b] and P a refinement of P. Let us estimate the 


difference between the Riemann sums o(f; P, g )-—o(S; P,é). Using the notation 
introduced above, we can write 


lo(f; P,€) —o(f; P.8)| = > 3 f Gi Axij — 3 f Axi] = 


i=1 j=l 


n nj n nj 


=|) > £GpAnj — 0 >) FG) Axil = 


i=1 j=l i=1 j=1 


n nj 


=|S° (fs) — FED) Axi] < 


i=1 j=l 


n nj 


YF Gs - FE)|Arxy < 


i=1 j=1 


IA 


n nj 


<>) olf: AD Axi = Dos Aj) Axi. 


i=l j=1 


In this computation we have used the relation Ax; = pee Ax;; and the inequality 
lf ij) — f&)| < o(f; Aj), which holds because &;; € Ajj C Aj and & € Aj. 

It follows from the estimate for the difference of the Riemann sums that if the 
function satisfies the sufficient condition given in the statement of Proposition 2, 
then for any ¢ > 0 we can find 6 > 0 such that 


lo(f; P,&) —o(f: P.&)| <5 


for any partition P of [a, b] with mesh A(P) < 6, any refinement P of P, and any 
choice of the sets of distinguished points € and é. 

Now if (P’,é’) and (P”,&”) are arbitrary partitions with distinguished points 
on [a,b] whose meshes satisfy 4(P’) < 5 and 4(P”) < 4, then, by what has just 
been proved, the partition P = P’ U P”, which is a refinement of both of them, 
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must satisfy 


lo(f: P,é)—o(f; P',E)| < 


’ 


é€ 
= 
lo(fs P,)—o(f; P”,8")| <5. 


It follows that 
lo(f: P',&) —o(fs P",8")| <e, 


provided 4(P’) < 6 and X(P") < 6. Therefore, by the Cauchy criterion, the limit of 
the Riemann sums exists: 


lim ian, 


MP)> 


that is f € Ria, bd). 


Corollary 1 (f € C[a, b]) => (f € Ria, b)), that is, every continuous function on 
a closed interval is integrable on that closed interval. 


Proof If a function is continuous on a closed interval, it is bounded there, so that 
the necessary condition for integrability is satisfied in this case. But a continuous 
function on a closed interval is uniformly ponent ¢ on that interval. Therefore, for 
every € > 0 there exists 6 > 0 such that w(f; A) < ;= on any closed interval A C 
[a, b] of length less than 5. Then for any partition P with mesh A(P) <6 we have 


Dow Aj) Ax; <p ana peas 


By Proposition 2, we can now conclude that f € R[a, bd]. 


Corollary 2 If a bounded function f on a closed interval [a,b] is continuous ev- 
erywhere except at a finite set of points, then f € Ra, b]. 


Proof Let w(f;[a, b]) < C < o, and suppose f has k points of discontinuity on 
[a, b]. We shall verify that the sufficient condition for integrability of the function f 
is satisfied. 

For a given ¢ > 0 we choose the number 6; = ICE and construct the 6,- 
neighborhood of each of the k points of discontinuity of f on [a,b]. The com- 
plement of the union of these neighborhoods in [a, b] consists of a finite number of 
closed intervals, on each of which f is continuous and hence uniformly continuous. 
Since the number of these intervals is finite, given ¢ > 0 there exists 52 > 0 such 
that on each interval A whose length is less than 52 and which is entirely contained 
in one of the 0508 intervals just mentioned, on which f is continuous, we have 


w(f; A) < Xba) a We now choose 6 = min{6j, 53}. 
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Let P be an arbitrary partition of [a, b] for which A(P) < 6. We break the sum 
y-'_, o(f; Aj) Ax; corresponding to the partition P into two parts: 


lof; An Axi = y> o(f: Aj) Ax; + y> of: Aj) Ax;. 


i=l 


The sum )~’ contains the terms corresponding to intervals A; of the partition having 
no points in common with any of the 6;-neighborhoods of the points of discontinu- 
ity. For these intervals A; we have w(f; Aj) < Xa? and so 


/ € ’ € € 
; A;)Ax; < ———~ Ax; < b =? 
Yo Of ADAX: Tee Maa Oa 
The sum of the lengths of the remaining intervals of the partition P, as one can 
easily see, is at most (6 + 26; + 6)k < 4aCE -k= Gree and therefore 


" ” € & 
ys olf; AnAx <C>> Anpe Ce = 5: 


Thus we find that for A(P) < 6, 


Y\\o(fi Ai Ax <e, 


i=l 


that is, the sufficient condition for integrability holds, and so f € R[a, b]. 


Corollary 3 A monotonic function on a closed interval is integrable on that inter- 
val. 


Proof It follows from the monotonicity of f on [a, b] that w(f; [a, b]) =| f(b) - 
f(a)|. Suppose ¢ > 0 is given. We set 6 = HO STONE We assume that f(b) — 
f (a) £0, since otherwise f is constant, and there is no doubt as to its integrability. 
Let P be an arbitrary partition of [a, b] with mesh A(P) < 6. 

Then, taking account of the monotonicity of f, we have 


Yiof: ADA <5) o(f; Aid) =8 )|f Ga) -— f@i-1| = 
i=] i=l i=1 


= 5/9 (f(a) — fi-v)| =4| f@) — f@| =e. 


i=1 


Thus f satisfies the sufficient condition for integrability, and therefore f € 
Ria, b]. 


A monotonic function may have a (countably) infinite set of discontinuities on a 
closed interval. For example, the function defined by the relations 
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aoe 1-54, forl- 4 <x<1- 7, neN, 
1 forx =1 


on [0, 1] is nondecreasing and has a discontinuity at every point of the form 1 — aa 
neéeN. 


Remark We note that, although we are dealing at the moment with real-valued func- 
tions on an interval, we have made no use of the assumption that the functions are 
real-valued rather than complex-valued or even vector-valued functions of a point of 
the closed interval [a, b], either in the definition of the integral or in the propositions 
proved above, except Corollary 3. 

On the other hand, the concept of upper and lower Riemann sums, to which we 
now turn, is specific to real-valued functions. 


Definition 6 Let f : [a,b] — R be a real-valued function that is defined and 
bounded on the closed interval [a,b], let P be a partition of [a,b], and let A; 
(i = 1,...,m) be the intervals of the partition P. Let m; = infye,, f(x) and 


Mj = Sup, f@) @=1,...,n). 
The sums 


s(f; P) =) mj Ax; 


i=1 


and 


n 
Sf; P):= > M; Ax; 
i=l 

are called respectively the lower and upper Riemann sums of the function f on the 
interval [a, b] corresponding to the partition P of that interval.'! The sums s(f; P) 
and S(f; P) are also called the lower and upper Darboux sums corresponding to 
the partition P of [a, b]. 

If (P, &) is an arbitrary partition with distinguished points on [a, b], then obvi- 
ously 


s(f;P)<o(f; P,é) < S(f; P). (6.7) 
Lemma 1 


s(f; P) =e: P,é), 


S(f; race P,&). 


The term “Riemann sum” here is not quite accurate, since m; and M; are not always values of the 
function f at some point &; € A;. 
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Proof Let us verify, for example, that the upper Darboux sum corresponding to 
a partition P of the closed interval [a, b] is the least upper bound of the Riemann 
sums corresponding to the partitions with distinguished points (P, €), the supremum 
being taken over all sets € = (&,..., &,) of distinguished points. 

In view of (6.7), it suffices to prove that for any ¢ > 0 there is a set € of distin- 
guished points such that 


Sf; P)<o(f; P,é +e. (6.8) 


By definition of the numbers M;, for each i € {1,...,} there is a point é, E€ Aj 


at which M; < f(&;) + -&. Let & = (&|,...,&,). Then 


> Mi Ax < (Fé ~ 7) 40 =f Gprx +e, 
i=l i=l i=l 


which completes the proof of the second assertion of the lemma. The first assertion 
is verified similarly. 


From this lemma and inequality (6.7), taking account of the definition of the 
Riemann integral, we deduce the following proposition. 


Proposition 3 A bounded real-valued function f : [a,b] — R is Riemann-integra- 
ble on [a, b] if and only if the following limits exist and are equal to each other: 


I= lim s(f:P), T= lim S(f;P). (6.9) 
(P)>0 M(P)>0 


When this happens, the common value I = I = 1 is the integral 


b 
/ f(x) dx. 


Proof Indeed, if the limits (6.9) exist and are equal, we conclude by the properties 
of limits and by (6.7) that the Riemann sums have a limit and that 


f= ae gr ie) 
On the other hand, if f € R[a, b], that is, the limit 


li >P,g8)=1 
go &) 


exists, we conclude from (6.7) and (6.8) that the limit lim, (p)+0 S(f; P) = T exists 
and / =I. 
Similarly one can verify that lim,;p).9s(f; P) =l=T. 


As a corollary of Proposition 3, we obtain the following sharpening of Proposi- 
tion 2. 
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Proposition 2’ A necessary and sufficient condition for a function f : [a,b] > R 
defined on a closed interval [a, b] to be Riemann integrable on [a, b] is the following 
relation: 


not Aj) Ax; = 0. (6.10) 


aes 


Proof Taking account of Proposition 2, we have only to verify that condition (6.10) 
is necessary for f to be integrable. 
We remark that w(f; A;) = M; — m;, and therefore 


Y\w(f; Ai Ax; = > (Mj — mi) Axi = S(f; P) — (fi P), 


i=l i=l 


and (6.10) now follows from Proposition 3 if f € R[a, b]. 


c. The Vector Space R[a, b] 


Many operations can be performed on integrable functions without going outside 
the class of integrable functions R[a, b]. 


Proposition 4 /f f, g © Ria, b], then 


a) (f +g) €Rla, b); 

b) (af) € RIia, b], where a is a numerical coefficient; 
c) | f| € Ria, 5]; 

d) fltc,a) € Rie, d] if[e, d] C [a, 5]; 

e) (f -g) € Ria, db]. 


We are considering only real-valued functions at the moment, but it is worthwhile 
to note that properties a), b), c), and d) turn out to be valid for complex-valued and 
vector-valued functions. For vector-valued functions, in general, the product f - g is 
not defined, so that property e) is not considered for them. However, this property 
continues to hold for complex-valued functions. 

We now turn to the proof of Proposition 4. 


Proof a) This assertion is obvious since 


n n n 
V+ au => GA +  gGdAn. 
i=l i=l i=l 
b) This assertion is obvious, since 


n 


Vif ENAr =a >) fEi)Ax- 


i=l i=l 
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c) Since w(|f|; E) < w(f; E), we can write 


Yio fl: ADAx < Do o(f; ADAx, 


i=l i=l 


and conclude by Proposition 2 that (f € R[a, b]) > (| f| € Ria, b)). 

d) We want to verify that the restriction f|[c,q| to [c,d] of a function f that is 
integrable on the closed interval [a, b] is integrable on [c, d] if [c,d] C [a, b]. Let 
a be a partition of [c, d]. By adding points to zr, we extend it to a partition P of the 
closed interval [a, b] so as to have A(P) < A(z). It is clear that one can always do 
this. 

We can then write 


> OS leas AdAx: SY O(fs ADNAN, 


where >, is the sum over all the intervals of the partition 2 and )* p the sum over 
all the intervals of P. 

By construction, as A(r) — 0 we have A(P) > 0 also, and so by Proposition 2’ 
we conclude from this inequality that (f € R[a,b]) > (f € Ric, d]) if [c,d] Cc 
[a, b]. 

e) We first verify that if f € R[a, b], then fF € R[a, b]. 

If f € Ra, b], then f is bounded on [a, b]. Let | f(x)| < C < oo on [a, b]. Then 


| f7 1) — f?x2)| = |(F Or) + fr2)) - (Ff 1) — f2))| < 2C| fx) — f 2) 


’ 


and therefore w(f7; E) < 2Cw(f; E) if E C [a, b]. Hence 
n n 
Y- of? Aj) Ax; <2C Vw fs ADA, 
i=1 i=l 
from which we conclude by Proposition 2’ that 


(f € Ria, b]) > (f* € Ria, b)). 


We now turn to the general case. We write the identity 


1 
(f -g)@)= 1 + g)°(x) —(f — 8)°(x)]. 


From this identity and the result just proved, together with a) and b), which have 
already been proved, we conclude that 


(f € Ria, b]) A (g € Ria, b]) > (f -g € Ria, b)). 


You already know what a vector space is from your study of algebra. The real- 
valued functions defined on a set can be added and multiplied by a real number, 
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both operations being performed pointwise, and the result is another real-valued 
function on the same set. If functions are regarded as vectors, one can verify that 
all the axioms of a vector space over the field of real numbers hold, and the set of 
real-valued functions is a vector space with respect to the operations of pointwise 
addition and multiplication by real numbers. 

In parts a) and b) of Proposition 4 it was asserted that addition of integrable 
functions and multiplication of an integrable function by a number do not lead out- 
side the class R[a, b] of integrable functions. Thus 7e[a, b] is itself a vector space — 
a subspace of the vector space of real-valued functions defined on the closed interval 
[a, b]. 


d. Lebesgue’s Criterion for Riemann Integrability of a Function 


In conclusion we present, without proof for the time being, a theorem of Lebesgue 
giving an intrinsic description of a Riemann-integrable function. 
To do this, we introduce the following concept, which is useful in its own right. 


Definition 7 A set E C R has measure zero or is of measure zero (in the sense of 
Lebesgue) if for every number ¢ > 0 there exists a covering of the set E by an at 
most countable system {/;} of intervals, the sum of whose lengths a |T,| is at 
most €. 

Since the series }“?~ ; |Jx| converges absolutely, the order of summation of the 
lengths of the intervals of the covering does not affect the sum (see Proposition 4 of 
Sect. 5.5.2), so that this definition is unambiguous. 


Lemma 2 a) A single point and a finite number of points are sets of measure Zero. 
b) The union of a finite or countable number of sets of measure zero is a set of 
measure zero. 
c) A subset of a set of measure zero is itself of measure zero. 
d) A closed interval [a, b] with a < b is not a set of measure zero. 


Proof a) A point can be covered by one interval of length less than any preassigned 
number ¢ > 0; therefore a point is a set of measure zero. The rest of a) then follows 
from b). 

b) Let E = ,, E” be an at most countable union of sets E” of measure zero. 
Given ¢ > 0, for each E” we construct a covering {J;’} of E” such that )>, |J/'| < 
aa 

Since the union of an at most countable collection of at most countably many 
sets is itself at most countable, the intervals 77’, k,n € N, form an at most countable 
covering of the set E, and 


aH <s+55 aT ae 
n,k 
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The order of summation >°,, , |//'| on the indices n and k is of no importance, 
since the series converges to the same sum for any order of summation if it converges 
in even one ordering. Such is the case here, since any partial sums of the series are 
bounded above by e. 

Thus E is a set of measure zero in the sense of Lebesgue. 

c) This statement obviously follows immediately from the definition of a set of 
measure zero and the definition of a covering. 

d) Since every covering of a closed interval by open intervals contains a finite 
covering, the sum of the lengths of which obviously does not exceed the sum of the 
lengths of the intervals in the original covering, it suffices to verify that the sum of 
the lengths of open intervals forming a finite covering of a closed interval [a, b] is 
not less than the length b — a of that closed interval. 

We shall carry out an induction on the number of intervals in the covering. 

For n = 1, that is, when the closed interval [a, b] is contained in one open interval 
(a, B), itis obvious thata <a<b< Band B-—a>b-—a. 

Suppose the statement is proved up to index k € N inclusive. Consider a cover- 
ing consisting of k + 1 open intervals. We take an interval (a1, @2) containing the 
point a. If a2 > b, then a2 — a; > b — a, and the result is proved. If a < a2 <b, 
the closed interval [a2, b] is covered by a system of at most k intervals, the sum of 
whose lengths, by the induction assumption, must be at least b — a2. But 


b—az=(b— az) +a2 —a < (b—a2) + (@2 — 1), 


and so the sum of the lengths of all the intervals of the original covering of the 
closed interval [a, b] was greater than its length b — a. 


It is interesting to note that by a) and b) of Lemma 2 the set Q of rational points 
on the real line R is a set of measure zero, which seems rather surprising at first 
sight, upon comparison with part d) of the same lemma. 


Definition 8 If a property holds at all points of a set X except possibly the points 
of a set of measure zero, we say that this property holds almost everywhere on X or 
at almost every point of X. 

We now state Lebesgue’s criterion for integrability. 


Theorem A function defined on a closed interval is Riemann integrable on that 
interval if and only if it is bounded and continuous at almost every point. 


Thus, 
(f € Ria, b]) > (f is bounded on [a, b]) A 
A ( f is continuous almost everywhere on [a, b)). 


It is obvious that one can easily derive Corollaries 1, 2, and 3 and Proposition 4 
from the Lebesgue criterion and the properties of sets of measure zero proved in 
Lemma 2. 
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We shall not prove the Lebesgue criterion here, since we do not need it to work 
with the rather regular functions we shall be dealing with for the present. However, 
the essential ideas involved in the Lebesgue criterion can be explained immediately. 

Proposition 2’ contained a criterion for integrability expressed by relation (6.10). 
The sum = ;-1 ©(f; A;) Ax; can be small on the one hand because of the fac- 
tors w(f; A;), which are small in small neighborhoods of points of continuity 
of the function. But if some of the closed intervals A; contain points of discon- 
tinuity of the function,then w(f; A;) does not tend to zero for these points, no 
matter how fine we make the partition P of the closed interval [a, b]. However, 
w(f; Ai) < w(f; [a, b]) < 06 since f is bounded on [a, b]. Hence the sum of the 
terms containing points of discontinuity will also be small if the sum of the lengths 
of the intervals of the partition that cover the set of points of discontinuity is small; 
more precisely, if the increase in the oscillation of the function on some intervals 
of the partition is compensated for by the smallness of the total lengths of these 
intervals. 

A precise realization and formulation of these observations amounts to the 
Lebesgue criterion. 

We now give two classical examples to clarify the property of Riemann integra- 
bility for a function. 


Example I The Dirichlet function 


1 forx €Q, 


D(x) = 
0 forx €R\Q, 


on the interval [0, 1] is not integrable on that interval, since for any partition P of 
[0, 1] one can find in each interval A; of the partition both a rational point / and an 
irrational point &/’. Then 


o(f; P.é') Se Ax; =1, 


i=l 


while 
o(f; P.&") -do. Ax; =0. 


Thus the Riemann sums of the function D(x) cannot have a limit as A(P) > 0. 

From the point of view of the Lebesgue criterion the nonintegrability of the 
Dirichlet function is also obvious, since D(x) is discontinuous at every point of 
[0, 1] which, as was shown in Lemma 2, is not a set of measure zero. 


Example 2 Consider the Riemann function 
Rix) ‘, if x € Q and x = is in lowest terms, n € N, 
FS a0) js 
0, ifxeR\Q. 
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We have already studied this function in Sect. 4.1.2, and we know that 7(x) is 
continuous at all irrational points and discontinuous at all rational points except 0. 
Thus the set of points of discontinuity of R(x) is countable and hence has mea- 
sure zero. By the Lebesgue criterion, 7¢(x) is Riemann integrable on any interval 
[a,b] C R, despite there being a discontinuity of this function in every interval of 
every partition of the interval of integration. 


Example 3 Now let us consider a less classical problem and example. 

Let f : [a,b] — R be a function that is integrable on [a, b], assuming values in 
the interval [c,d] on which a continuous function g : [c,d] — R is defined. Then 
the composition g o f : [a,b] — R is obviously defined and continuous at all the 
points of [a,b] where f is continuous. By the Lebesgue criterion, it follows that 
(go f) € Ria, bd]. 

We shall now show that the composition of two arbitrary integrable functions is 
not always integrable. 

Consider the function g(x) = |sgn|(x). This function equals | for x 4 0 and 0 
for x = 0. By inspection, we can verify that if we take, say the Riemann function 
f on the closed interval [1, 2], then the composition (g o f)(x) on that interval is 
precisely the Dirichlet function D(x). Thus the presence of even one discontinuity 
of the function g(x) has led to nonintegrability of the composition go f. 


6.1.4 Problems and Exercises 


1. The theorem of Darboux. 


a) Let s(f; P) and S(f; P) be the lower and upper Darboux sums of a real- 
valued function f defined and bounded on the closed interval [a, b] and correspond- 
ing to a partition P of that interval. Show that 


sf; Pi) = SF; Pa) 


for any two partitions P; and P of [a, b]. 

b) Suppose the partition P is a refinement of the partition P of the interval 
[a, b], and let Aj,,..., Aj, be the intervals of the partition P that contain points of 
the partition P that do not belong to P. Show that the following estimates hold: 


0 <S(f; P)— S(f; P) <o(f; [a, b]) (Axi +--+ Axi), 
O<s(f; P)—s(f; P) <o(f5 [a, b}) - (Axi, +--+ Ary). 
c) The quantities J = supp s(f; P) and I =infp S(f; P) are called respectively 


the lower Darboux integral and the upper Darboux integral of f on the closed 
interval [a, b]. Show that J < 7. 
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d) Prove the theorem of Darboux: 


I= lim s(f:P), T= lim S(f;P). 
MP)=>0 MP)>0 


e) Show that (f € R[a, b]) & Z=J). 
f) Show that f € R[a, b] if and only if for every ¢ > 0 there exists a partition P 
of [a, b] such that S(f; P) —s(f; P) <e. 


2. The Cantor set of Lebesgue measure Zero. 


a) The Cantor set described in Problem 7 of Sect. 2.4 is uncountable. Verify that 
it nevertheless is a set of measure 0 in the sense of Lebesgue. Show how to modify 
the construction of the Cantor set in order to obtain an analogous set “full of holes” 
that is not a set of measure zero. (Such a set is also called a Cantor set.) 

b) Show that the function on [0, 1] defined to be zero outside a Cantor set and 
1 on the Cantor set is Riemann integrable if and only if the Cantor set has measure 
zero. 

c) Construct a nondecreasing continuous and nonconstant function on [0, 1] that 
has a derivative equal to zero everywhere except at the points of a Cantor set of 
measure zero. 


3. The Lebesgue criterion. 


a) Verify directly (without using the Lebesgue criterion) that the Riemann func- 
tion of Example 2 is integrable. 

b) Show that a bounded function f belongs to R[a, b] if and only if for any two 
numbers ¢ > 0 and 6 > 0 there is a partition P of [a,b] such that the sum of the 
lengths of the intervals of the partition on which the oscillation of the function is 
larger than ¢ is at most 6. 

c) Show that f € R[a, b] if and only if f is bounded on [a, b] and for any ¢ > 0 
and 6 > 0 the set of points in [a, b] at which f has oscillation larger than ¢ can be 
covered by a finite set of open intervals the sum of whose lengths is less than 6 (the 
du Bois-Reymond criterion).” 

d) Using the preceding problem, prove the Lebesgue criterion for Riemann in- 
tegrability of a function. 


4. Show that if f,g € R[a,b] and f and g are real-valued, then max{f, g} € 
R{a, b] and min{ f, g} € R[a, bd]. 
5. Show that 


a) if fig € Ria,b] and f(x) = g(x) almost everywhere on [a,b], then 


b b 
fi f@)dx= f a@) de; 
b) if f € Ria, b] and f(x) = g(x) almost everywhere on [a, b], then g can fail 
to be Riemann-integrable on [a, b], even if g is defined and bounded on [a, b]. 


2P. du Bois-Reymond (1831-1889) — German mathematician. 
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6. Integration of vector-valued functions. 


a) Let r(t) be the radius-vector of a point moving in space, ro = r(0) the initial 
position of the point, and v(t) the velocity vector as a function of time. Show how 
to recover r(t) from ro and v(t). 

b) Does the integration of vector-valued functions reduce to integrating real- 
valued functions? 

c) Is the criterion for integrability stated in Proposition 2’ valid for vector-valued 
functions? 

d) Is Lebesgue’s criterion for integrability valid for vector-valued functions? 

e) Which concepts and facts from this section extend to functions with complex 
values? 


6.2 Linearity, Additivity and Monotonicity of the Integral 


6.2.1 The Integral as a Linear Function on the Space R[a, b] 


Theorem 1 /f f and g are integrable functions on the closed interval [a, b], a linear 
combination of them af + Bg is also integrable on [a, b], and 


b b b 
[ r+ bear =a f fad +B | g(x) dx. (6.11) 


Proof Consider a Riemann sum for the integral on the left-hand side of (6.11), and 
transform it as follows: 


Y (af + Bg) Gi) Axi =a D* fE)AM +BY gGi)Axi. (6.12) 


i=l i=l i=l 


Since the right-hand side of this last equality tends to the linear combination of 
integrals that makes up the right-hand side of (6.11) if the mesh 1.(P) of the partition 
tends to 0, the left-hand side of (6.12) must also have a limit as A(P) — 0, and that 
limit must be the same as the limit on the right. Thus (wf + 6g) € R[a, b] and 
Eq. (6.11) holds. 


If we regard R[a, b] as a vector space over the field of real numbers and the inte- 
gral 4 Ff (x) dx as a real-valued function defined on vectors of R[a, b], Theorem | 
asserts that the integral is a linear function on the vector space R[a, b]. 

To avoid any possible confusion, functions defined on functions are usually 
called functionals. Thus we have proved that the integral is a linear functional on 
the vector space of integrable functions. 
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6.2.2 The Integral as an Additive Function of the Interval 
of Integration 


The value of the integral Vi f(x) dx = I(f; [a, b]) depends on both the integrand 
and the closed interval over which the integral is taken. For example, if f € R[a, b], 
then, as we know, f|[a,p] € Fla, B] if [a, 8] C [a, b], that is, the integral { f(x) dx 
is defined, and we can study its dependence on the closed interval [a, 8] of integra- 
tion. 


Lemma 1 [fa <b<cand f € R[a, cl], then f tan) € Ria, bl, fle € RIB, cl, 
and the following equality’ holds; 


(3 b c 
/ foyax= [ poyar+ [ f(x) dx. (6.13) 
a a b 


Proof We first note that the integrability of the restrictions of f to the closed inter- 
vals [a, b] and [b, c] is guaranteed by Proposition 4 of Sect. 6.1. 

Next, since f € R[a,c], in computing the integral i f(x) dx as the limit of 
Riemann sums we may choose any convenient partitions of [a,c]. We shall now 
consider only partitions P of [a,c] that contain the point b. Obviously any such 
partition with distinguished points (P, €) generates partitions (P’, €’) and (P”, €”) 
of [a, b] and [b, c] respectively, and P = P’ U P” and é =&' UE". 

But then the following equality holds between the corresponding Riemann sums: 


o(f; P,€)=o0(f; P’,&') +o(f; P”.&"). 


Since A(P’) < A(P) and A(P”) < A(P), for A(P) sufficiently small, each of these 
Riemann sums is close to the corresponding integral in (6.13), which consequently 
must hold. 


To widen the application of this result slightly, we temporarily revert once again 
to the definition of the integral. 
We defined the integral as the limit of Riemann sums 


eG: P,6)=) fGAn, (6.14) 


i=l 


corresponding to partitions with distinguished points (P, &) of the closed interval of 
integration [a, b]. A partition P consisted of a finite monotonic sequence of points 
X0,X1,---,Xn, the point x9 being the lower limit of integration a and x, the upper 
limit of integration b. This construction was carried out assuming that a < b. If we 


3We recall that f|E denotes the restriction of the function f to a set E contained in the domain 
of definition of f. Formally we should have written the restriction of f to the intervals [a, b] and 
[b, c], rather than f, on the right-hand side of Eq. (6.13). 


6.2 Linearity, Additivity and Monotonicity of the Integral 351 


now take two arbitrary points a and b without requiring a < b and, regarding a as 
the lower limit of integration and b as the upper, carry out this construction, we shall 
again obtain a sum of the form (6.14), in which now Axj > O(@ =1,...,n)ifa<b 
and Ax; <O(i=1,...,n)ifa>b, since Ax; = x; — x;_-,. Thus for a > b the sum 
(6.14) will differ from the Riemann sum of the corresponding partition of the closed 
interval [b, a] (b < a) only in sign. 

From these considerations we adopt the following convention: if a > b, then 


b a 
| f(x) dx i= -| f(x) dx. (6.15) 
a b 


In this connection, it is also natural to set 


a f(x) dx :=0. (6.16) 


After these conventions, taking account of Lemma |, we arrive at the following 
important property of the integral. 


Theorem 2 Leta,b,c € Rand let f be a function integrable over the largest closed 
interval having two of these points as endpoints. Then the restriction of f to each of 
the other closed intervals is also integrable over those intervals and the following 
equality holds: 


b c a 
i f(x) dx +f f(x)dx + i; f(x)dx =0. (6.17) 
a b ra 


Proof By the symmetry of Eq. (6.17) in a, b, and c, we may assume without loss of 
generality that a = min{a, b, c}. 
If max{a, b,c} =c anda < b <c, then by Lemma | 


b c c 
/ flayde + f fex)de ~ | f(x)dx =0, 
a b a 


which, when we take account of the convention (6.15) yields (6.17). 
If max{a, b,c} =b anda <c <b, then by Lemma | 


c b b 
[ sears | fiaydx = | f(x) dx =0, 


which, when we take account of (6.15), again yields (6.17). 
Finally, if two of the points a, b, and c are equal, then (6.17) follows from the 
conventions (6.15) and (6.16). 


Definition 1 Suppose that to each ordered pair (a, §) of points a, 6B € [a, b] anum- 
ber J (a, B) is assigned so that 


I(a,y) =I (a, B) + 1(B, y) 


for any triple of points a, 6, y € [a, b]. 
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Then the function J(q@, 6) is called an additive (oriented) interval function de- 
fined on intervals contained in [a, b]. 

If f € R[A, B], anda, b,c e€[A, B], then, setting J (a, b) = ie Ff (x) dx, we con- 
clude from (6.17) that 


c b c 
/ feaax= [ fladrt f f(x) dx, (6.18) 
a a b 


that is, the integral is an additive interval function on the interval of integration. 
The orientation of the interval in this case amounts to the fact that we order the 
pair of endpoints of the interval by indicating which is to be first (the lower limit of 
integration) and which is to be second (the upper limit of integration). 


6.2.3 Estimation of the Integral, Monotonicity of the Integral, 
and the Mean-Value Theorem 


a. A General Estimate of the Integral 


We begin with a general estimate of the integral, which, as will become clear later, 
holds for integrals of functions that are not necessarily real-valued. 


Theorem 3 [fa <band f € R{a, bl, then | f| € Ria, b] and the following inequal- 


ity holds: 
b b 
| f (x) dx <| | fl(x) dx. (6.19) 
If | f \(x) < C on [a, b] then 
b 
[ ita sco-a (6.20) 


Proof For a = b the assertion is trivial, and so we shall assume that a < b. 
To prove the theorem it now suffices to recall that | f| ¢ [a,b] (see Propo- 
sition 4 of Sect. 6.1), and write the following estimate for the Riemann sum 


o(f; P,&): 


<)-|f&)||Axil = Do | FE) Axi < CY Axi = C(b - a). 


i=1 i=l i=l 


> fGNAx: 


i=l 


Passing to the limit as A(P) — 0, we obtain 


b 
[ fou 


b 
<[ \flcyax = co ~a). 
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b. Monotonicity of the Integral and the First Mean- Value Theorem 
The results that follow are specific to integrals of real-valued functions. 


Theorem 4 Ifa <b, fi, f2 € Ria, bl, and f\(x) < fo(x) at each point x € [a, b], 
then 


b b 
[ fmas [ fa(x) dx. (6.21) 


Proof For a = b the assertion is trivial. If a < b, it suffices to write the following 
inequality for the Riemann sums: 


> fiE) Axi < > f2 (Ei) Axi, 


i=1 i=1 


which is valid since Ax; > 0 @ =1,...,7), and then pass to the limit as A(P) > 0. 
Theorem 4 can be interpreted as asserting that the integral is monotonic as a 
function of the integrand. 
Theorem 4 has a number of useful corollaries. 


Corollary 1 Ifa <b, f € R[a, b]; andm < f(x) < M at each x € [a, b], then 

b 

m-b-a)s | f(x)dx < M-(b—-a), (6.22) 

a 

and, in particular, if 0 < f(x) on [a, b], then 
b 
o< | f(x) dx. 
a 


Proof Relation (6.22) is obtained by integrating each term in the inequality m < 
f(x) < M and using Theorem 4. 


Corollary 2 If f ¢ Ria, b], m = infyetap) f(x), and M = SUP x ¢[a,b] f(x), then 
there exists anumber tt € [m, M] such that 


b 
/ f(x) dx = w-(b—a). (6.23) 


Proof If a = b, the assertion is trivial. If a # b, we set w= 54 [” f(x) dx. It then 
follows from (6.22) that m < uw < M if a < b. But both sides of (6.23) reverse sign 
if a and b are interchanged, and therefore (6.23) is also valid for b <a. 


Corollary 3 If f € C[a, b], there is a point — € [a, b] such that 


b 
/ f(x) dx = f(E)(b — a). (6.24) 
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Proof By the intermediate-value theorem for a continuous function, there is a 
point € on [a, b] at which f(&€) = w if 


m= min f(x)<u< max fx)=M. 
xe[a,b] xe[a,b] 


Therefore (6.24) follows from (6.23). 


The equality (6.24) is often called the first mean-value theorem for the integral. 
We, however, reserve that name for the following somewhat more general proposi- 
tion. 


Theorem 5 (First mean-value theorem for the integral) Let f, g € R[a,b], m = 
infyefa,p] f(x), and M = supy eta. f(). If g is nonnegative (or nonpositive) on 
[a, b], then 


b b 
/ (f -g)(x)dx =m / g(x) dx, (6.25) 


where u €[m, M). 
If, in addition, it is known that f € C[a, b], then there exists a point & € [a, b] 
such that 


b b 
/ (f -a)(x) dx = f(E) / reves (6.26) 


Proof Since interchanging the limits of integration leads to a simultaneous sign 
reversal on both sides of Eq. (6.25), it suffices to verify this equality for the case 
a <b. Reversing the sign of g(x) also reverses the signs of both sides of (6.25), so 
that we may assume without loss of generality that g(x) > 0 on [a, b]. 

Since m = infyefa,p] f(x) and M = SUP <[a,b] Ff (x), we have, for g(x) => 0, 


mg(x) < f(x)g(x) < Mg(x). 
Since m- g € Ria, b], f -g ¢ Ria, b], and M - g € RIa, b], applying Theorem 4 
and Theorem |, we obtain 


b b b 
mf ecoar = [ foga)dx <M | g(x) dx. (6.27) 


If Via g(x) dx = 0, it is obvious from these inequalities that (6.25) holds. 
If [? g(x) dx £0, then, setting 


b -1 b 
u=(f s(x) a) | (f - g(x) dx, 


we find by (6.27) that 
m<u<M, 
but this is equivalent to (6.25). 
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The equality (6.26) now follows from (6.25) and the intermediate-value theorem 
for a function f € C[a, b], if we take account of the fact that when f € C[a, b], we 
have 


m= min x) and M= max x). 
cout ars ) xe[a,b] F@) 


We remark that (6.23) results from (6.25) if g(x) = 1 on [a, b]. 


c. The Second Mean-Value Theorem for the Integral 


The so-called second mean-value theorem’ is significantly more special and delicate 
in the context of the Riemann integral. 

So as not to complicate the proof of this theorem, we shall carry out a useful 
preparatory discussion that is of independent interest. 

Abel’s transformation. This is the name given to the following transformation of 
the sum )°7_, ajbj. Let Ay = Sy aj; we also set Ang = 0. Then 


m n n n 
> abi = (Ai — Ai) bi = DO Ai: — > Ai-18; = 
i=l 


i=1 t=! i=] 


n n—1 n—-1 
= \> Aibi — )~ Aibig1 = Anbn — Aobi + > Ai (bi — bi 41). 
i=1 i=0 i=1 
Thus 
n n—l 
Y> ajbj = (Anbn — Aobi) + > Aj (bj — bi41), (6.28) 
i= i=1 
or, since Ag = 0, 

n n—l 
Yo aibi = Anbn + D> Ai(bi — bi41). (6.29) 


i=l i=l 
Abel’s transformation provides an easy verification of the following lemma. 
Lemma 2 /f the numbers Ax = yy aj (k =1,...,n) satisfy the inequalities m < 


Ag < M and the numbers bj (i = 1,...,n) are nonnegative and bj > bj+4, fori = 
1,...,n —1, then 


n 
mb, < So aibi < Mb. (6.30) 


i=1 


4Under an additional hypothesis on the function, one that is often completely acceptable, Theo- 
rem 6 in this section could easily be obtained from the first mean-value theorem. On this point, see 
Problem 3 at the end of Sect. 6.3. 
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Proof Using the fact that b, > 0 and bj — bj4; > 0 fori =1,...,n — 1, we obtain 
from (6.29), 


n n-1 
> aibi SMbn +) M0; — bi41) = Mby + M(b1 — bn) = M1. 
i=1 i=l 


The left-hand inequality of (6.30) is verified similarly. 


Lemma 3 /f f € R[a, b], then for any x € [a, b] the function 
ra= | f(t) dt (6.31) 


is defined and F(x) € C[{a, b]. 


Proof The existence of the integral in (6.31) for any x € [a, b] is already known 
from Proposition 4 of Sect. 6.1; therefore it remains only for us to verify that the 
function F(x) is continuous. Since f € R[a, b], we have | f| < C < oo on [a, dD]. 
Let x € [a, b] and x +h ¢€ [a, b]. Then, by the additivity of the integral and inequal- 


ities (6.19) and (6.20) we obtain 
x+h x 
/ soar f f(@) dt 


xth xth 
y f(t)dt / | f (t)| de 


Here we have used inequality (6.20) taking account of the fact that for h < 0 we 


have 
x+h x x 
i | f (t)| de -|-f | f (t)| de =| | f (t)| de. 
x xth xth 


Thus we have shown that if x and x + h both belong to [a, b], then 


|F@ +h) — F(x)| = 


= = <C|hI. 


|F(x +h) — F(x)| <C|h| (6.32) 


from which it obviously follows that the function F' is continuous at each point of 
[a, b]. 


We now prove a lemma that is a version of the second mean-value theorem. 


Lemma 4 /f f, g © R[a, b] and g is anonnegative nonincreasing function on [a, b] 
then there exists a point & € [a, b] such that 


b 3 
(f - g)(x) dx = g(a) / fla)dx. (6.33) 


6.2 Linearity, Additivity and Monotonicity of the Integral 357 


Before turning to the proof, we note that, in contrast to relation (6.26) of the first 
mean-value theorem, it is the function f(x) that remains under the integral sign in 
(6.33), not the monotonic function g. 


Proof To prove (6.33), as in the cases considered above, we attempt to estimate the 
corresponding Riemann sum. 
Let P be a partition of [a, b]. We first write the identity 


b n Xj 
[r-wa=> | (fede 
@ i=] °%i-1 


=e fo fear > | “ [e(@) — ei] fa) dx 
i=l Xi-1 i=l ° 7% 


i-1 


and then show that the last sum tends to zero as A(P) > 0. 
Since f € R[a, b], it follows that | f(x)| < C < o on [a, b]. Then, using the 
properties of the integral already proved, we obtain 


>| | [g(x) — gQu-1)] f @) de 
i=] °%i-1 


=f |e -s0)|| fooler < 
i=] °%i-1 


~ |g) — g(x) dx < 


<c¥ f 
i=1°* 


i-1 


n 
<C) lols; 4)) Ax; > 0 
i= 


as A(P) — 0, because g € Ra, b] (see Proposition 2 of Sect. 6.1). Therefore 


b n X; 
-g)(x)dx = lim Xj- x) dx. 6.34 
/ (f -g)@) Ree ae of f(x) (6.34) 
We now estimate the sum on the right-hand side of (6.34). Setting 


F(x) = / "fodder, 


by Lemma 3 we obtain a continuous function on [a, b]. 
Let 


m= min F(x) and M= max F(x). 
xe[a,b] xe[a,b] 
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Since ae f(x) dx = F(x;) — F(x;-1), it follows that 


n 


Dewi f | f(x)dx = > (FQ) — FGi-1))g@i-1- (6.35) 
i=l Xi-1 


i=l 


Taking account of the fact that g is nonnegative and nonincreasing on [a, b], and 
setting 


ay = F(x) — F(xi-1), bi = g(xi-1), 


we find by Lemma 2 that 
n 
mg(a) < )\(F (xi) — F@i-1)) g@i-1) < Mga), (6.36) 
i=1 
since 
k 


Ag =) (ai = F (xx) — F(x) = Fae) — F(@) = F(x). 
i=1 
Having now shown that the sums (6.35) satisfy the inequalities (6.36), and re- 
calling relation (6.34), we have 


b 
mga) = [ (f - g)(x) dx < Mg(a). (6.37) 


If g(a) = 0, then, as inequalities (6.37) show, the relation to be proved (6.33) is 
obviously true. 
If g(a) > 0, we set 


1 b 
b= — | (f - g)(x) dx. 
g(a) a 


It follows from (6.37) that m < uw < M, and from the continuity of F(x) = 
iP f(t) dt on [a, b] that there exists a point € € [a, b] at which F(€) = yw. But that 
is precisely what formula (6.33) says. 


Theorem 6 (Second mean-value theorem for the integral) If f, g ¢ R[a, b| and g 
is a monotonic function on [a, b], then there exists a point & € [a, b| such that 


b é b 
/ if odes seh / Oe | Fx)dx. (6.38) 


The equality (6.38) (like (6.33), as it happens) is often called Bonnet’ s formula? 


5P.O. Bonnet (1819-1892) — French mathematician and astronomer. His most important mathe- 
matical works are in differential geometry. 
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Proof Let g be a nondecreasing function on [a, b]. Then G(x) = g(b) — g(x) is 
nonnegative, nondecreasing, and integrable on [a, b]. Applying formula (6.33), we 
find 


b & 
/ (f -G)(x)dx = ca | St (x) dx. (6.39) 


But 


b b b 
[ r-@er=ew | flayax = | Pea, 


g g g 
Ga | flx)dr = gb) | fla)dx— gta) | f(x) dx. 


Taking account of these relations and the additivity of the integral, we obtain the 
equality (6.38), which was to be proved, from (6.39). 

If g is a nonincreasing function, setting G(x) = g(x) — g(b), we find that G(x) 
is a nonnegative, nonincreasing, integrable function on [a, b]. We then obtain (6.39) 
again, and then formula (6.38). 


6.2.4 Problems and Exercises 


1. Show that if f € R[a, b] and f(x) => 0 on [a, bd], then the following statements 
are true. 


a) If the function f(x) assumes a positive value f (xo) > 0 at a point of conti- 
nuity xo € [a, b], then the strict inequality 


b 
/ f(x) dx >0 


holds. 
b) The condition f i Ff (x) dx =0 implies that f(x) = 0 at almost all points of 
[a, D]. 


2. Show that if f € R[a, b], m = infja,p, f(x), and M = SUP}a,p, f (x), then 


a) iF Ff (x) dx = (b — a), where jz € [m, M] (see Problem 5a of Sect. 6.1); 
b) if f is continuous on [a, b], there exists a point € € Ja, b[ such that 


b 
/ f(x) dx = f(E)(b — a). 


3. Show that if f € Cla, b], f(x) = 0 on [a, b], and M = maxjq,p) f(x), then 


b 1/n 
sim ([ "(aya =M. 
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4. a) Show that if f € R[a, b], then | f|? € R[a, b] for p > 0. 


b) Starting from Hélder’s inequality for sums, obtain Hoélder’s inequality for 


integrals:° 
b I/p b 1/q 
<(f isl?) ax) (| sI"() dr) 


if f,g € Ria, b], p>1,q> land 5 +5 =1, 
c) Starting from Minkowski’s inequality for sums, obtain Minkowski’s inequal- 
ity for integrals: 


b 1/p b 1/p b 1/p 
(/ f+ ei?) dr) <(f LflP (x) ar) +(f el")dr) ; 


if f,g € R[a,b] and p > 1. Show that this inequality reverses direction if 0 < 
p<. 

d) Verify that if f is a continuous convex function on R and ¢ an arbitrary 
continuous function on R, then Jensen’s inequality 


1 f° 1 f° 
(=f omar) <2 [ remyar 


b 
/ (f - g)(x) dx 


holds for c £0. 
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6.3.1 The Integral and the Primitive 


Let f be a Riemann-integrable function on a closed interval [a, b]. On this interval 
let us consider the function 


F(x) = [ f@ dt, (6.40) 


often called an integral with variable upper limit. 
Since f € Ra, b], it follows that f|[a,xj € Ria, x] if [a,x] C [a, b]; therefore 
the function x + F(x) is unambiguously defined for x € [a, b]. 


©The algebraic Hélder inequality for p = q = 2 was first obtained in 1821 by Cauchy and bears his 
name. Holder’s inequality for integrals with p = q = 2 was first discovered in 1859 by the Russian 
mathematician B.Ya. Bunyakovskii (1804-1889). This important integral inequality (in the case 
P =4q =2) 1s called Bunyakovskii’s inequality or the Cauchy—Bunyakovskii inequality. One also 
some-times sees the less accurate name “Schwarz inequality” after the German mathematician 
H.K.A. Schwarz (1843-1921), in whose work it appeared in 1884. 
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If | f(t)| < C < +00 on [a,b] (and f, being an integrable function, is bounded 
on [a, b]), it follows from the additivity of the integral and the elementary estimate 
of it that 


|F(x +h) — F(x)| <ClAl, (6.41) 


ifx,x+hel[a,b]. 

Actually, we already discussed this while proving Lemma 3 in the preceding 
section. 

It follows in particular from (6.41) that the function F is continuous on [a, b], so 
that F € C[a, b]. 

We now investigate the function F more thoroughly. 

The following lemma is fundamental for what follows. 


Lemma 1 /f f € [a,b] and the function f is continuous at a point x € [a, b], 
then the function F defined on [a, b] by (6.40) is differentiable at the point x, and 
the following equality holds: 


F'(x) = f(x). 


Proof Let x,x +h € [a,b]. Let us estimate the difference F(x + h) — F(x). It 
follows from the continuity of f at x that f(t) = f(x) + A(t), where A(t) > 0 
as t > x, t € [a, b]. If the point x is held fixed, the function A(t) = f(t) — f(x) 
is integrable on [a, b], being the difference of the integrable function tr> f(t) and 
the constant f(x). We denote by M(h) the quantity sup,<7(,) |A(¢)|, where /(h) is 
the closed interval with endpoints x, x + h € [a,b]. By hypothesis M(h) — 0 as 
h->0. 
We now write 
x+ 


x+h % h 
Fath) F=f foar— | fnar= [ f(t)dt= 


xt+h 
= (f(x) + AQ) dt = 


xth x+h 
ay fanart | A(t) dt = f(x)h+a(hjh, 
* o 
where we have set 
x+h 
i A(t) dt =a(h)jh. 
x 
Since 

< 


< = M(h)\h|, 


xth xth x+h 
/ A(t) dt ; | A(t)| dr i M(h) dt 


it follows that |a(h)| < M(h), and so a(h) > 0.as h > 0 (in sucha way thatx +h € 
[a, b]). 
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Thus we have shown that if the function f is continuous at a point x € [a, b], then 
for displacements h from x such that x + h € [a, b] the following equality holds: 


F(x +h) — F(x) = f(x)h ta(hyh, (6.42) 


where a(h) > Oash— 0. 
But this means that the function F(x) is differentiable on [a, b] at the point x € 
[a, b] and that F’(x) = f (x). 


A very important immediate corollary of Lemma | is the following. 


Theorem 1 Every continuous function f : [a,b] > R on the closed interval [a, b] 
has a primitive, and every primitive of f on [a, b] has the form 


x 
F(x)= i f@dt+c, (6.43) 
a 
where c is a constant. 


Proof We have the implication (f € C[a, b]) > (f € R[a, b]), so that by Lemma | 
the function (6.40) is a primitive for f on [a, b]. But two primitives F(x) and F(x) 
of the same function on a closed interval can differ on that interval only by a con- 
stant; hence F(x) = F(x) +c. 


For later applications it is convenient to broaden the concept of primitive slightly 
and adopt the following definition. 


Definition 1 A continuous function x + F(x) on an interval of the real line is 
called a primitive (or generalized primitive) of the function x — f(x) defined on 
the same interval if the relation F’(x) = f(x) holds at all points of the interval, with 
only a finite number of exceptions. 

Taking this definition into account, we can assert that the following theorem 
holds. 


Theorem 1’ A function f : [a,b] > R that is defined and bounded on a closed 
interval [a, b] and has only a finite number of points of discontinuity has a (gen- 
eralized) primitive on that interval, and any primitive of f on [a,b] has the form 


(6.43). 


Proof Since f has only a finite set of points of discontinuity, f € 7e[a, b], and by 
Lemma | the function (6.40) is a generalized primitive for f on [a, b]. Here we have 
taken into account, as already pointed out, the fact that by (6.41) the function (6.40) 
is continuous on [a, b]. If F(x) is another primitive of f on [a, b], then F(x) — F(x) 
is a continuous function and constant on each of the finite number of intervals into 
which the discontinuities of f divide the closed interval [a, b]. But it then follows 
from the continuity of F(x) — F(x) on all of [a, b] that F(x) — F(x) =const on 
[a, bl. 
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6.3.2 The Newton—Leibniz Formula 


Theorem 2 /f f : [a,b] > R is a bounded function with a finite number of points 
of discontinuity, then f € Rla, b] and 


b 
f(x) dx = F(b) — F(a), (6.44) 


where F : [a,b] > R is any primitive of f on [a, b]. 

Proof We already know that a bounded function on a closed interval having only a 
finite number of discontinuities is integrable (see Corollary 2 after Proposition 2 in 
Sect. 6.1). The existence of a generalized primitive F(x) of the function f on [a, b] 


is guaranteed by Theorem 1’, by virtue of which F(x) has the form (6.43). Setting 
x =a in (6.43), we find that F(a) =c, and so 


Fue i} fat + Fa). 


In particular 


b 
i f(t) dt = F(b) — F(a), 


which, up to the notation for the variable of integration, is exactly formula (6.44), 
which was to be proved. 


Relation (6.44), which is fundamental for all of analysis, is called the Newton— 
Leibniz formula (or the fundamental theorem of calculus). 

The difference F(b) — F(a) of values of any function is often written F(x)| . .In 
this notation, the Newton—Leibniz formula assumes the form 


b 
/ f(x)dx =F x)’, 


Since both sides of the formula reverse sign when a and b are interchanged, the 
formula is valid for any relation between the magnitudes of a and J, that is, both for 
a<bandfora>b. 

In exercises of analysis the Newton—Leibniz formula is mostly used to compute 
the integral on the left-hand side, and that may lead to a some-what distorted idea 
of its use. The actual situation is that particular integrals are rarely found using a 
primitive; more often one resorts to direct computation on a computer using highly 
developed numerical methods. The Newton—Leibniz formula occupies a key po- 
sition in the theory of mathematical analysis itself, since it links integration and 


364 6 Integration 


differentiation. In analysis it has a very far-reaching extension in the form of the 
so-called generalized Stokes’ formula.’ 

An example of the use of the Newton—Leibniz formula in analysis itself is pro- 
vided by the material in the next subsection. 


6.3.3 Integration by Parts in the Definite Integral and Taylor’s 
Formula 


Proposition 1 [f the functions u(x) and v(x) are continuously differentiable on a 
closed interval with endpoints a and b, then 


b b b 
/ (u-v)apdx = wl f (v-u’)(x) dx. (6.45) 


It is customary to write this formula in abbreviated form as 


b b b 
i udv=u-v|t— f vdu 
a a 


and call it the formula for integration by parts in the definite integral. 


Proof By the rule for differentiating a product of functions, we have 
(u- v(x) = (u’-v)(@x) + (u- v’) (x). 


By hypothesis, all the functions in this last equality are continuous, and hence inte- 
grable on the interval with endpoints a and b. Using the linearity of the integral and 
the Newton—Leibniz formula, we obtain 


b 


b 
wool =f (urna + f (u- v')(x) de. 


a 


As a corollary we now obtain the Taylor formula with integral form of the re- 
mainder. 

Suppose on the closed interval with endpoints a and x the function tt f(t) has 
n continuous derivatives. Using the Newton—Leibniz formula and formula (6.45), 
we carry out the following chain of transformations, in which all differentiations 
and substitutions are carried out on the variable r: 


fa-f@= / f' (pdt =— / fe — 0) dt = 


7G.G. Stokes (1819-1903) — British physicist and mathematician. 
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=o + f"O@—-tdt= 
1 f(* / 
= f'(a(x —a)- Al f" O(a —- 1) dt = 
/ 1 " 2|* i ‘ my 2 
=f@E-a)—- sf O@-H E+5f f° OG — 1) dt = 


1 
= f'(ay(x—a) + 5f"@( - a)’ - sf f"O(@-1) dt = 


= f'(a)(x —a)+ 5a gy fae 


+ — tp (qy( —a)"" + ry_1(a; x), 
--(n—1) 


where 


1 ~ 
rn—1(a; X) = aa | f™ OG * di (6.46) 


Thus we have proved the following proposition. 


Proposition 2 [f the function t > f(t) has continuous derivatives up to order n 
inclusive on the closed interval with endpoints a and x, then Taylor’s formula holds: 


1 
FO=fAQ+TZ/O&-a+--4+ FOV @o-a)" 1 + m1 4) 


(n— a 


with remainder term rn—\(a; x) represented in the integral form (6.46). 


We note that the function (x — t)”~! does not change sign on the closed interval 
with endpoints a and x, and since t > f(t) is continuous on that interval, the 
first mean-value theorem implies that there exists a point € such that 


Se 1 ° (n) — yt-laqpe 
rn—1(a; x) = q=i) PYeoO tm —-t) dt = 


=o ff" (x—lar= 


a a fe(- wr= 9") 


We have again obtained the familiar Lagrange form of the remainder in Taylor’s 
theorem. By Problem 2d) of Sect. 6.2, we may assume that & lies in the open interval 
with endpoints a and x. 


= “F&C ay", 
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This reasoning can be repeated, taking the expression f)(€)(x — &)"~*, where 
k €[1,n], outside the integral in (6.46). The Cauchy and Lagrange forms of the 
remainder term that result correspond to the values k = | andk =n. 


6.3.4 Change of Variable in an Integral 


One of the basic formulas of integral calculus is the formula for change of variable 
in a definite integral. This formula is just as important in integration theory as the 
formula for differentiating a composite function is in differential calculus. Under 
certain conditions, the two formulas can be linked by the Newton—Leibniz formula. 


Proposition 3 [fy :[a, 8] — [a, b] is a continuously differentiable mapping of the 
closed interval a < t < B into the closed interval a < x < b such that g(a) =a and 
y(B) =), then for any continuous function f (x) on [a, b] the function f (v(t))g'(t) 
is continuous on the closed interval [a, B |, and 
b B 
/ f(x)dx = / f(g@)¢"o) de. (6.47) 
a a 

Proof Let F(x) be a primitive of f(x) on [a,b]. Then, by the theorem on differ- 
entiation of a composite function, the function F(g(t)) is a primitive of the func- 
tion f(v(t))y'(t), which is continuous, being the composition and product of con- 
tinuous functions on the closed interval [a, 8]. By the Newton—Leibniz formula 


I f(x) dx = F(b) — F(a) and ioe f(e@)g' dt = F(e(B)) — F(p(@)). But by 
hypothesis g(a~) = a and g(B) = D, so that Eq. (6.47) does indeed hold. 


It is clear from formula (6.47) how convenient it is that we have not just the sym- 
bol for the function, but the entire differential f(x) dx in the symbol for integration, 
which makes it possible to obtain the correct integrand automatically when the new 
variable x = y(t) is substituted in the integral. 

So as not to complicate matters with a cumbersome proof, in Proposition 3 we 
deliberately shrank the true range of applicability of (6.47) and obtained it by the 
Newton—Leibniz formula. We now turn to the basic theorem on change of variable, 
whose hypotheses differ somewhat from those of Proposition 3. The proof of this 
theorem will rely directly on the definition of the integral as the limit of Riemann 
sums. 


Theorem 3 Let y : [a, B] — [a,b] be a continuously differentiable strictly mono- 
tonic mapping of the closed interval a < t < B into the closed interval a < x <b 
with the correspondence p(a) = a, g(6B) = b or g(a) = b, o(B) =a at the 
endpoints. Then for any function f(x) that is integrable on [a,b] the function 
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f (e(t))¢' (t) is integrable on [a, B| and 


g(B) B 
ik f(x) dx =a f(g@)¢'( de. (6.48) 
g(a a 


Proof Since ¢ is a strictly monotonic mapping of [@, 8] onto [a, b] with endpoints 
corresponding to endpoints, every partition P; (@ = t9 <--- <t, = B) of the closed 
interval [a@, 8] generates a corresponding partition P, of [a, b] by means of the im- 
ages x; = g(t;) (i =0,..., 7); the partition P, may be denoted g(P,). Here xp =a 
if g(a) =a and x9 = b if p(a) = b. It follows from the uniform continuity of g on 
[a, B] that if A(P;) — 0, then A(P,) = A(@(P;)) also tends to zero. 

Using Lagrange’s theorem, we transform the Riemann sum o(f; P,, &) as fol- 
lows: 


Yo FG) Ax = > FENG — 1-1) = 


i=1 i=1 


=>" flee GG — 4-1) = F(e@))¢' An. 


i=1 i=1 


Here x; = g(t;), & = (7), &; lies in the closed interval with endpoints x;_1 
and x;, and the points t; and 7; lie in the interval with endpoints t;_; and t; @ = 


.,N). 
Next 


n 


Vi fe@)e'HAt = 3 f (oti) g(t) Ati + 


i=l i=l 
+ 5° f(g) (¢' G) — 9a) At. 
i=l 


Let us estimate this last sum. Since f € R[a, b], the function f is bounded on 
[a, b]. Let | f(x)| < C on [a, b]. Then 


ROOK g(t) — g(t) ) At; <C. Low i) Atj, 


i=l i=1 


where A; is the closed interval with endpoints 7;_1 and #;. 
This last sum tends to zero as A(P;) — 0, since g’ is continuous on [a, ]. 
Thus we have shown that 


> fF ENAM => (OCH) 0G) At + &, 
i=l 


i=1 
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where a — 0 as A(P;) > 0. As already pointed out, if A(P;) > 0, then A(P,) > 0 
also. But f € Ra, b], so that as A(P,) — 0 the sum on the left-hand side of this 
last equality tends to the integral Ls Ff (x) dx. Hence as 4(P;) — 0 the right-hand 
side of the equality also has the same limit. 

But the sum )~7_, f (v(t)! (ti) At; can be regarded as a completely arbitrary 
Riemann sum for the function f (g(t))g’(t) corresponding to the partition P, with 
distinguished points tT = (T),..., T,), since in view of the strict monotonicity of ¢, 
any set of points t can be obtained from some corresponding set € = (&1,..., &,) of 
distinguished points in the partition P, = g(P;). 

Thus, the limit of this sum is, by definition, the integral of the function 
f (e(t))¢'(t) over the closed interval [aw, 8], and we have simultaneously proved 
both the integrability of f(y(t))¢’(t) on [a, B] and formula (6.48). 


6.3.5 Some Examples 


Let us now consider some examples of the use of these formulas and the theorems 
on properties of the integral proved in the last two sections. 


Example 1 
1 m/2 m/2 
/ Vvl—x?2dx = V1 = sin? reost dt = f cos’ t dt = 
= —n/2 —1/2 
1 ie 1 1 1 
=x (1 400820) dt = 5(14+ 5 sin2r) =. 
2 —n/2 2 2 —7/2 2 


In computing this integral we made the change of variable x = sint and then, 
after finding a primitive for the integrand that resulted from this substitution, we 
applied the Newton—Leibniz formula. 

Of course, we could have proceeded differently. We could have found the rather 
cumbersome primitive 5x 1—x2+4 5 arcsinx for the function 1 — x? and then 
used the Newton—Leibniz formula. This example shows that in computing a defi- 
nite integral one can fortunately sometimes avoid having to find a primitive for the 


integrand. 


Example 2 Let us show that 


us IU 
a) sinmx cosnx dx = 0, b) / sin? mx dx = I, 
—T =Tt 
Tw 
c) cos* nx dx = 7 
—T 


form,neéeN. 
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us 1 us 
i sinmx cosnx dx = Al (sin(n +m)x — sin(n — m)x) dx = 


4 


n—-m 


cos(n — mx) 


1 1 
= 5 (-- cos(n + m)x + 


I 
=0, 


if n —m 40. The case when n — m = 0 can be considered separately, and in this 
case we obviously arrive at the same result. 


b) 
1 {(” 1 1 i 
i sin? mx dx = = (1 —cos2mx) dx = (3 -— sin2ns =. 
= Qe) a5 2 2m ieee 
c) 
e 1 7 1 1 is 
i cos? nde = 5 f (1 + cos 2nx) dx = =| x + —sin2nx =. 
_ pe 2 2n a 


Example 3 Let f € R[—a, a]. We shall show that 


2 fo f(x) dx, if f is an even function, 


0, if f is an odd function. 


: f(x) dx = | 
If f(—x) = f(x), then 


a 0 a 0 a 
f(x) dx = flxydr + | fesyde= | fennar+ f f(x) dx = 
a, ~a 0 a 0 
=[ fiat f fisydr= f (f(—x) + f(x) dx = 


=2 |" fiayar. 
0 


If f(—x) = —f (x), we obtain from the same computations that 


a 


fears [ (Fn + fen)ae= f Odx =0. 
4 0 0 


Example 4 Let f be a function defined on the entire real line R and having period T, 
thatis f(x +T) = f(x) forall x ER. 
If f is integrable on each finite closed interval, then for any a € R we have the 


equality 
a+T T 
[ fear= [i rover, 
a 0 
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that is, the integral of a periodic function over an interval whose length equals the 
period T of the function is independent of the location of the interval of integration 
on the real line: 


a+T 0 T a+T 
i: flax)dx = / f(x)dx + / f(x)dx + i flx)dx = 
a a 0 


T 


T 0 a 
4, fade + | pode [ f(@t+T)-1dt= 
0 a 0 


T 0 a T 
=) flxydx + | reser + f finar= f f(x) dx. 
0 a 0 0 


Here we have made the change of variable x = ¢ + T and used the periodicity of 
the function f(x). 


Example 5 Suppose we need to compute the integral ie sin(x*) dx, for example 
within 10-7. 

We know that the primitive f sin(x”) dx (the Fresnel integral) cannot be ex- 
pressed in terms of elementary functions, so that it is impossible to use the Newton— 
Leibniz formula here in the traditional sense. 

We take a different approach. When studying Taylor’s formula in differential 
calculus, we found as an example (see Example 11 of Sect. 5.3) that on the interval 
[—1, 1] the equality 


: 3 1 5 
sinx ¥x — 37 + 57 =: P(x) 
holds within 10-3. 
But if | sinx — P(x)| < 10~> on the interval [-1, 1], then | sin(x?) — P(x?)| < 
10-3 also, for 0 <x <1. 
Consequently, 


[ sin(x?) af P(x?) dx 


1 1 
=| |sin(x*) — P(x7)| dx <| 10 de 107°. 
0 0 


Thus, to compute the integral rh sin(x*) dx with the required precision, it suffices 


to compute the integral fi P(x?) dx. But 


1 1 1 1 
i P(s?)ar= f x2 ee ee dx = 
0 0 3! 5! 


1 1 1 
~_ x? xt 4 xl 1 1 1 
3! 317 5! 


= 0.310 + 1073, 
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and therefore 


1 
/ sin(x*) dx = 0.31042 x 10°? = 0.314 10°”. 
0 


Example 6 The quantity u = gy? J (x) dx is called the integral average value of 
the function on the closed nen val [a, b]. 

Let f be a function that is defined on R and integrable on any closed interval. 
We use f to construct the new function 


1 x+6 
naa sf : f@)dt, 


whose value at the point x is the integral average value of f in the 6-neighborhood 
of x. 

We shall show that F(x) (called the average of f) is, compared to f, more regu- 
lar. More precisely, if f is integrable on any interval [a, b], then F'5(x) is continuous 
on R, and if f € C(R), then F;(x) € C (R). 

We verify first that F(x) is continuous: 


x+d+h x—6d 
val pars f f(t) dt] < 
x x—d+h 


5 (Cal + Clhl) = Chl 


| Fs(x +h) — F5(x)| = 


oe Sic 


if | f(t)| < C, for example, in the 26-neighborhood of x and |h| < 6. It is obvious 
that this estimate implies the continuity of Fs(x). 
Now if f € C(R), then by the rule for differentiating a composite function 


d seo) d f? dy F 
af foa=Z pf roa-F = sew), 


so that from the expression 


1 x+6 1 x—5 
raya = f fina = | f(t) dt 


we find that 
f(x +6) — f(x — 4) 
26 , 


After the change of variable t = x + u in the integral, the function F(x) can be 
written as 


F.Os 


1 r) 
Fs(x) = a5 |, Fort wae. 
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If f € C(R), then, applying the first mean-value theorem, we find that 
F(x) = fd) B= fet), 
where |t| < 6. It follows that 
Pau F(6)(x) = f(), 


which is completely natural. 


6.3.6 Problems and Exercises 


1. Using the integral, find 
a) lim ool aEpz qe oe roma 


a at aoe a . 
ree, ifa>0. 


b) limy—+oo 


2. a) Show that any continuous function on an open interval has a primitive on that 
interval. 

b) Show that if f €¢ C‘[a, b], then f can be represented as the difference of 
two nondecreasing functions on [a, b] (see Problem 4 of Sect. 6.1). 


3. Show that if the function g is smooth, then the second mean-value theorem (The- 
orem 6 of Sect. 6.2) can be reduced to the first mean-value theorem through integra- 
tion by parts. 

4. Show that if f € C(R), then for any fixed closed interval [a, b], given e« > 0 one 
can choose 6 > 0 so that the inequality | s(x) — f(x)| < e holds on [a, b], where 
Fs is the average of the function studied in Example 6. 

5. Show that 


x" ef 1 2 
—dt~ se as x > +00. 
1 t x 


6. a) Verify that the function f(x) = / is a sin(t*) dt has the following representa- 
tion as x —> oo: 


f(x) = 


cos(x2)  cos(x + 1)” ( 1 ) 
O : 
2x 2(x + 1) x2 


b) Find lim, _, ., xf (x) and limy-+.0 xf (x). 


7. Show that if f : IR — R is a periodic function that is integrable on every closed 
interval [a, b] C R, then the function 


rays [ f(t)dt 


can be represented as the sum of a linear function and a periodic function. 
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8. a) Verify that for x > 1 andn €N the function 


1 a 
Py (x) = -| (x + Vx? — 1cos¢)" dp 
0 


is a polynomial of degree n (the nth Legendre polynomial). 
b) Show that 


P,(x) - | dw 
n\X) = . 
mt Jo (x—vx*—I1cosp)” 


9. Let f be a real-valued function defined on a closed interval [a,b] C R and 
&1,...,&m distinct points of this interval. The values of the Lagrange interpolat- 
ing polynomial of degree m — 1 


Lmn—-1(%) := > fe] — 
J l 


j=! iAj 


are equal to the values of the function at the points &,...,&,, (the nodes of the 
interpolation), and if f ¢ C’” [a, b], then 


1 
f(x) — Lm-1«) = ear a (¢(x))@m(x), 


where @,, (x) = [ey (x — &;) and f(x) € Ja, b[ (see Exercise 11 in Sect. 5.3). 
Let & = 244 + 946;; then 6 €[—-1, 1],i=1,...,m. 


a) Show that 


b NO 
| tmsenar="$* Va reo. 
7 i=l 


where 


In particular 


on) fy Lox) dx = (b — a) f ($4), if m = 1, 61 =0; 
an) f? Li(x)dx = 254 f(a) + fO)], ifm =2, 0) =—1, & =1; 
b-—a 


2 
a3) he La(x) dx = =|"[ f(a) + 4 f(2*) + fO)I, if m = 3, 0) =—-1, & =O, 
63 = 1. 


b) Assuming that f € Ca, b] and setting Mm = Maxyefa,b] | f™ (x)I, esti- 
mate the magnitude R,, of the absolute error in the formula 
b 
a 


b 
i ide i Ln Aten, (*) 
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and show that |Rn| < “* ? |om(x)| dx. 

c) In cases a), a2), and @3) formula (*) is called respectively the rectangular, 
trapezoidal, and parabolic rule. In the last case it is also called Simpson’s rule.® 
Show that the following formulas hold in cases a1), a2), and 3): 


Ri FON — a, R=) — a, 
_ FR) 5 
3=—~ R90 OO: 


where &1, &, 3 € [a, b] and the function f belongs to a suitable class Cla, bj. 
d) Let f be a polynomial P. What is the highest degree of polynomials P for 
which the rectangular, trapezoidal, and parabolic rules respectively are exact? 
Leth= ba x, =a t+ hk (k=0, 1,...,”), and yy = f (xx). 
e) Show that in the rectangular rule 


b 
/ f(x) dx =h(yo + yi +++- + yn-1) + Ri 
a 


the remainder has the form R; = f® (b — a)h, where & € [a, b]. 
f) Show that in the trapezoidal rule 


h 
/ f(x) dx = ZL + yn) +201 + y2 +++ + yn-1)] + Ro 


the remainder has the form R2 = — £@ (b — ayh?, where & € [a, b]. 
g) Show that in Simpson’ s rule (the parabolic rule) 


b 
h 
/ f(x) dx = rake ae Vad =e AM te Ya oF Ye) 
a 
+ 2(y2 + y4 +++: + yn—2)] + Rs, 
which can be written for even values of n, the remainder R3 has the form 


of) 
0 


3 b—a)h* 
ig (b—a)h’, 


where & € [a, b]. 
h) Starting from the relation 


[ dx 
w=4 —.,, 
9 1+x? 


87. Simpson (1710-1761) — British mathematician. 
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compute z within 10~3, using the rectangular, trapezoidal, and parabolic rules. Note 
carefully the efficiency of Simpson’s rule, which is, for that reason, the most widely 
used quadrature formula. (That is the name given to formulas for numerical inte- 
gration in the one-dimensional case, in which the integral is identified with the area 
of the corresponding curvilinear trapezoid.) 


10. By transforming formula (6.46), obtain the following forms for the remainder 
term in Taylor’s formula, where we have set h = x — a: 


n 1 = 
a) goto FO (at th) —1)""" dr; 
b) © fp fa — Avi) de. 


11. Show that the important formula (6.48) for change of variable in an integral 
remains valid without the assumption that the function in the substitution is mono- 
tonic. 


6.4 Some Applications of Integration 


There is a single pattern of ideas that often guides the use of integration in applica- 
tions; for that reason it is useful to expound this pattern once in its pure form. The 
first subsection of this section is devoted to that purpose. 


6.4.1 Additive Interval Functions and the Integral 


In discussing the additivity of the integral over intervals in Sect. 6.2 we introduced 
the concept of an additive (oriented) interval function. We recall that this is a func- 
tion (a, B) + I(a, B) that assigns a number J (a, 6) to each ordered pair of points 
(a, B) of a fixed closed interval [a,b], in such a way that the following equality 
holds for any triple of points a, 8, y € [a, b]: 


T(a,y) =I (a, B) + 1(B, y). (6.49) 


It follows from (6.49) when a = 8 = y that I(a,a@) = 0, while for a = y we 
find that I(a@, 6) + 1 (6, a) = 0, that is, 7(a, 8) = —1(B, a). This relation shows the 
effect of the order of the points a, 6. 

Setting 


F(x) =I(a,x), 
by the additivity of the function 7 we have 
T(a, B) =I (a, B) — I(a,a) = F(B) — F(a). 
Thus, every additive oriented interval function has the form 


(a, B) = F(B) — F(a), (6.50) 
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where x +> F(x) is a function of points on the interval [a, b]. 

It is easy to verify that the converse is also true, that is, any function x +> F(x) 
defined on [a,b] generates an additive (oriented) interval function by formula 
(6.50). 

We now give two typical examples. 


Example I If f € Ria, b], the function F(x) = i Ki f(t) dt generates via formula 
(6.50) the additive function 


B 
ra, p)= | f(t) dt. 


We remark that in this case the function F(x) is continuous on the closed interval 
[a, b]. 


Example 2 Suppose the interval [0, 1] is a weightless string with a bead of unit mass 
attached to the string at the point x = 1/2. 

Let F(x) be the amount of mass located in the closed interval [0, x] of the string. 
Then by hypothesis 


0 forx <1/2, 


F(x) = 
=) for i/2<x <1, 


The physical meaning of the additive function 
I(a, B) = F(B) - F@) 


for 8 > a@ is the amount of mass located in the half-open interval Ja, 6]. 

Since the function F is discontinuous, the additive function 7 (a, 6) in this case 
cannot be represented as the Riemann integral of a function — a mass density. (This 
density, that is, the limit of the ratio of the mass in an interval to the length of the 
interval, would have to be zero at any point of the interval [a, b] except the point 
x = 1/2, where it would have to be infinite.) 

We shall now prove a sufficient condition for an additive interval function to be 
generated by an integral, one that will be useful in what follows. 


Proposition 1 Suppose the additive function I (a, B) defined for points a, B of a 


closed interval [a, b] is such that there exists a function f € R[a, b] connected with 
Tas follows: the relation 


inf f(x)\(B-—a)</(a@,B)< sup f(x)(B-—a@) 
xe[a, 6] xéla,B] 
holds for any closed interval (a, B| such thata <a < B <b. Then 


b 
I(a, b) = f(x) dx. 
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Proof Let P be an arbitrary partition a = x9 <--- < x, = b of the closed interval 
[a, b], let m; = infye[x,;_,,x;) f(x), and let Mj = SUP x e[x;_1,x)] F(x). 
For each interval [x;—1, x;] of the partition P we have by hypothesis 


m; Ax; < I (xj-1, xi) < Mj Ax;. 


Summing these inequalities and using the additivity of the function /(q@, 6), we 
obtain 


n n 
So mj Ax; <I(a,b)< SoM; Ax;. 
i=l i=1 
The extreme terms in this last relation are familiar to us, being the upper and 
lower Darboux sums of the function f corresponding to the partition P of the closed 
interval [a, b]. As A(P) — 0 they both have the integral of f over the closed interval 
[a, b] as their limit. Thus, passing to the limit as A(P) — 0, we find that 


b 
I(a, b) =F f(x) dx. 


Let us now illustrate Proposition | in action. 


6.4.2 Arc Length 


Suppose a particle is moving in space R? and suppose its law of motion is known to 
be r(t) = (x(t), y(t), z(t)), where x(f), y(t), and z(t) are the rectangular Cartesian 
coordinates of the point at time f¢. 

We wish to define the length /[a, b] of the path traversed during the time interval 
a<t<b. 

Let us make some concepts more precise. 


Definition 1 A path in R? is a mapping t + (x(t), y(t), z(t)) of an interval of the 
real line into R? defined by functions x(t), y(t), z(t) that are continuous on the 


interval. 


Definition 2 If tt (x(t), y(t), z(t)) is a path for which the domain of the parame- 
ter ¢ is the closed interval [a, b] then the points 


A=(x@, y@,z(@)) and B= (x(d), y(b), z()) 
in R? are called the initial point and terminal point of the path. 


Definition 3 A path is closed if it has both an initial and terminal point, and these 
points coincide. 
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Definition 4 If  : J — R? is a path, the image (J) of the interval J in R? is 
called the support of the path. 

The support of an abstract path may turn out to be not at all what we would like 
to call a curve. There are examples of paths whose supports, for example, contain 
an entire three-dimensional cube (the so-called Peano “curves”). However, if the 
functions x(t), y(t), and z(t) are sufficiently regular (as happens, for example, in 
the case of a mechanical motion, when they are differentiable), we are guaranteed 
that nothing contrary to our intuition will occur, as one can verify rigorously. 


Definition 5 A path I : J > R? for which the mapping J > I(/) is one-to-one is 
called a simple path or parametrized curve, and its support is called a curve in R?. 


Definition 6 A closed path I : [a, b] + R? is called a simple closed path or simple 
closed curve if the path I : [a, b[ > R? is simple. 


Thus a simple path differs from an arbitrary path in that when moving over its 
support we do not return to points reached earlier, that is, we do not intersect our 
trajectory anywhere except possibly at the terminal point, when the simple path is 
closed. 


Definition 7 The path [ : J > R? is called a path of a given smoothness if the 
functions x(t), y(t), and z(t) have that smoothness. 
(For example, the smoothness C[a, b], Ca, b], or c# [a, b].) 


Definition 8 A path I : [a, b] — R? is piecewise smooth if the closed interval [a, b] 
can be partitioned into a finite number of closed intervals on each of which the 
corresponding restriction of the mapping I” is defined by continuously differentiable 
functions. 

It is smooth paths, that is, paths of class C‘ and piecewise smooth paths that we 
intend to study just now. 

Let us return to the original problem, which we can now state as the problem of 
defining the length of a smooth path I: [a, b] > R?. 

Our initial ideas about the length /[a, b] of the path traversed during the time 
interval a < t < # are as follows: First, if a < 6 < y, then 


I[a, y] =l[o, 6] +/[B, v], 


and second, if v(t) = (x(t), y(t), z(f)) is the velocity of the point at time r, then 


inf |v@)|(B —@) </la, B] < sup |v(t)|(6 — a). 
xe[a,B] xe[a,B] 


Thus, if the functions x(t), y(t), and z(t) are continuously differentiable on 
[a, b], by Proposition | we arrive in a deterministic manner at the formula 


b b 
ita. b1= f lvin|ar= f (20) +0 + 20d, (6.51) 
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which we now take as the definition of the length of a smooth path I : [a, b] > R°. 
If z(t) = 0, the support lies in a plane, and formula (6.51) assumes the form 


b 
ita, b1= | / x2(t) + $2(0) de. (6.52) 


Example 3 Let us test formula (6.52) on a familiar object. Suppose the point moves 
according to the law 


x = Rcos2nzt, 
(6.53) 
y= Rsin2zt. 


Over the time interval [0, 1] the point will traverse a circle of radius R, that is, a 
path of length 27 R if the length of a circle can be computed from this formula. 
Let us carry out the computation according to formula (6.52): 


1 
I[0, 1] = / V (—2m R sin 2mt)2 + (22 Roos 2zt)2 dt =27R. 
0 


Despite the encouraging agreement of the results, the reasoning just carried out 
contains some logical gaps that are worth paying attention to. 

The functions cosa@ and sina, if we use the high-school definition of them, are 
the Cartesian coordinates of the image p of the point po = (1, 0) under a rotation 
through angle a. 

Up to sign, the quantity @ is measured by the length of the arc of the circle 
x* + y* = 1 between po and p. Thus, in this approach to trigonometric functions 
their definition relies on the concept of the length of an arc of a circle and hence, in 
computing the circumference of a circle above, we were in a certain sense complet- 
ing a logical circle by giving the parametrization in the form (6.53). 

However, this difficulty, as we shall now see, is not fundamental, since a 
parametrization of the circle can be given without resorting to trigonometric func- 
tions at all. 

Let us consider the problem of computing the length of the graph of a function 
y = f (x) defined on a closed interval [a, b] C R. We have in mind the computation 
of the length of the path I" : [a, b] > R? having the special parametrization 


XR (x, f (x)), 
from which one can conclude that the mapping I" : [a,b] > R* is one-to-one. 


Hence, by Definition 5 the graph of a function is a curve in R?. 
In this case formula (6.52) can be simplified, since by setting x = t and y= f(t) 


in it, we obtain 
b 
I[a, b] =) V1l+[fOf az. (6.54) 
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In particular, if we consider the semicircle 
yeVlexs, =Lers1, 


of the circle x + y? = 1, we obtain for it 


7 —x 3 I dx 
l= 1 ———- | dx= ———-.. 6.55 
[ |S] [ 1— x? : 


But the integrand in this last integral is an unbounded function, and hence does 
not exist in the traditional sense we have studied. Does this mean that a semicircle 
has no length? For the time being it means only that this parametrization of the 
semicircle does not satisfy the condition that the functions x and y be continuous, 
under which formula (6.52), and hence also formula (6.54), was written. For that 
reason we must either consider broadening the concept of integral or passing to a 
parametrization satisfying the conditions under which (6.54) can be applied. 

We remark that if we consider this parametrization on any closed interval of the 
form [—1+6, 1 —6], where —1 < —1+6 <1—6 <1, then formula (6.54) applies 
on that interval, and we find the length 


1-6 dx 
t-1+5,1-a1= | —___. 
—146 V1 — x2 


for the arc of the circle lying above the closed interval [—1+ 6, 1 — 6]. 

It is therefore natural to consider that the length / of the semicircle is the limit 
lims_, +9 /[—1-+.6, 1 — 6]. One can interpret the integral in (6.55) in the same sense. 
We shall study this naturally arising extension of the concept of a Riemann integral 
in the next section. 

As for the particular problem we are studying, without even changing the 
parametrization one can find, for example, the length I-35, 51 of an arc of the unit 
circle subtended by a chord congruent to the radius of the circle. Then (from geo- 
metric considerations alone) it must be that / = 3 - /[— 5 5]. 

We remark also that 


dx (1 — x? + x?) dx =[vi ra 1 fxd —x?) 
— x2 dx — 
Iss / V1 — x2 2 V1 — x2 
=2 f Jiqsax- 2/1= 2, 


and therefore 


[-1+8,1-8]=2f Viaa2ae (V1 = x2)['75 


—1+6 


Thus, 


l= lim 7[-1+6,1-—6]= af V1l—x?dx. 
56—>+0 
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The length of a semicircle of unit radius is denoted z, and we thus arrive at the 
following formula 


1 
n=2f V1l—x2dx. 
-1 


This last integral is an ordinary (not generalized) Riemann integral and can be 
computed with any precision. 

If for x € [—1, 1] we define arccos x as /[x, 1], then by the computations carried 
out above 


1 
arccosx = ——., 
i V1—t2 


or 


1 
arccosx = xv 1 = #42] V1—tdt. 
x 


If we regard arc length as a primitive concept, then we must also regard the 
function x +> arccos x introduced just now and the function x +> arcsinx, which can 
be introduced similarly, as primitive. But the functions x + cosx and x +> sinx can 
then be obtained as the inverses of these functions on the corresponding intervals. 
In essence, this is what is done in elementary geometry. 

The example of the length of a semicircle is instructive not only because while 
studying it we made a remark on the definition of the trigonometric functions that 
may be of use to someone, but also because it naturally raises the question whether 
the number defined by formula (6.51) depends on the coordinate system x, y, z and 
the parametrization of the curve when one is finding the length of a curve. 

Leaving to the reader the analysis of the role played by three-dimensional Carte- 
sian coordinates, we shall examine here the role of the parametrization. 

We need to clarify that by a parametrization of a curve in R*, we mean a defini- 
tion of a simple path I : J > R* whose support is that curve. The point or number 
t € J is called a parameter and the interval J the domain of the parameter. 

If 7:1 — Land I’: I > CL are two one-to-one maepines with the same rage of 
values L, there naturally arise one-to-one mappings "~! ol: I > Tandr-'oP: 
I — I between the domains / and J of these mappings. 

In particular, if there are two parametrizations of the same curve, then there is 
a natural correspondence between the parameters ¢ € J and t € I,t=t(t)ort= 
T(t), making it possible, knowing the parameter of a point in one parametrization, 
to find its parameter in the other parametrization. 

Let I: [a,b] > £ and r: [a, B] =, L be two parametrizations of the same 
curve with the correspondences I"(a) = ‘- (a) and '(b) = ie (6) between their ini- 
tial and terminal points. Then the transition functions t = t(t) and t = t(t) from 
one parameter to another will be continuous, strictly monotonic mappings of the 
closed intervals a < t <b and a@ <t < 6 onto each other with the initial points and 
terminal points corresponding: a<a,b< 6. 

Here, if the curves I” and I are defined by the triples (x(t), y(t), z(t)) and 
(X(t), ¥(t), Z(t)) of smooth functions such that |v(t)|? = %7(t) + 97() + 22(1) £0 
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on [a, b] and ¥(t)|* = (tr) + (rz) + 2(r) # 0 on [a, B], then one can verify 
that in this case the transition functions tf = f(t) and t = T(t) are smooth functions 
having positive derivatives on the intervals on which they are defined. 

We shall not undertake to verify this assertion here; it will eventually be obtained 
as a corollary of the important implicit function theorem. At the moment this asser- 
tion is mostly intended as motivation for the following definition. 


Definition 9 The path P [a, 6] > R? is obtained from I : [a, b] > R?* by an 
admissible change of parameter if there exists a smooth mapping T : [a, 8B] — [a, b] 
such that T(~) = a, T(B) =b, T'(t) > Oon[a, B] and =L oT. 


We now prove a general proposition. 


Proposition 2 [f a smooth path I : [a, B] > R3 is obtained from a smooth path 
I’: [a,b] > R? by an admissible change of parameter, then the lengths of the two 
paths are equal. 


Proof Let I : [a, B] > R3 and I: [a,b] > R® be defined respectively by the 
triples of smooth functions t +> (x(t), y(t), Z(t)) and t B® (x(f), y(t), z(4)), and 
let t = t(t) be the admissible change of parameter under which 


&(t) =x(t(t)), H(t) = y(t(t)), Z(t) = z(t(t)). 


Using the definition (6.51) of path length, the rule for differentiating a composite 
function, and the rule for change of variable in an integral, we have 


b 
| (20+%0+204- 


B 
af J2(1@) + 32(t(r)) + 22(t(r))t'(t) dt = 


B 
=) VEC@@P + b@)r'@F + E¢@)'@F at = 


Bor : ; 
“ / (#2) + 320) + 2@ar. 


Thus, in particular, we have shown that the length of a curve is independent of a 
smooth parametrization of it. 

The length of a piecewise smooth path is defined as the sum of the lengths of the 
smooth paths into which it can be divided; for that reason it is easy to verify that the 
length of a piecewise smooth path also does not change under an admissible change 
of its parameter. 

To conclude the discussion of the concept of the length of a path and the length 
of a curve (which, after Proposition 2, we now have the right to talk about), we 
consider another example. 
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Fig. 6.2 ; B 


Example 4 Let us find the length of the ellipse defined by the canonical equation 


y 


5=1 (@>b>0). (6.56) 


oie 
No 
> 


Taking the parametrization x =asinw, y=bcosy, 0 < w < 27, we obtain 


20 20 
i= [ _ bsiny)? dy = [ ja - — b) ) sin? wdy = 


=4a ee sin? y dy = 4a fo 1 —k? sin? w dy, 


2°, i ; 
where k* = 1 — e is the square of the eccentricity of the ellipse. 


The integral 
yg 
E(k, o=| 1—k? sin? y dw 
0 


cannot be expressed in elementary functions, and is called an elliptic integral be- 
cause of the connection with the ellipse just discussed. More precisely, E(k, g) is 
the elliptic integral of second kind in the Legendre form. The value that it assumes 
for g = 2/2 depends only on k, is denoted E(k), and is called the complete elliptic 
integral of second kind. Thus E(k) = E(k, 1/2), so that the length of an ellipse has 
the form / = 4a E(k) in this notation. 


6.4.3 The Area of a Curvilinear Trapezoid 


Consider the figure aA Bb of Fig. 6.2, which is called a curvilinear trapezoid. This 
figure is bounded by the vertical line segments aA and bB, the closed interval [a, b] 
on the x-axis, and the curve AB, which is the graph of an integrable function y = 
f(x) on [a, b]. 

Let [a, 6] be a closed interval contained in [a, b]. We denote by S(a, 8) the area 
of the curvilinear trapezoid af (a) f(B)6 corresponding to it. 
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Our ideas about area are as follows: if a <a < B < y <b, then 


S(a, vy) = S(@, B) + S(B, y) 


(additivity of areas) and 


inf f(x)(B—a) <S(a,B)< sup f(x)(B-«). 
xe[a,p] xe[a,B] 


(The area of an enclosing figure is not less than the area of the figure enclosed.) 
Hence by Proposition |, the area of this figure must be computed from the for- 
mula 


b 
s(a,b)= f f(x) dx. (6.57) 


Example 5 Let us use formula (6.57) to compute the area of the ellipse given by the 
canonical equation (6.56). 

By the symmetry of the figure and the assumed additivity of areas, it suffices to 
find the area of just the part of the ellipse in the first quadrant, then quadruple the 
result. Here are the computations: 


a 2 m/2 
saa jwa(i- >) ax =a [ / = Sae neon 
0 a 0 
m/2 m/2 


= 4ab | cos*rdr = 2ab f (1 — cos 2t) dt = xab. 
0 0 


Along the way we have made the change of variable x =asint,0<t<7/2. 
Thus §S = zab. In particular, when a = b = R, we obtain the formula xR? for 
the area of a disk of radius R. 


Remark It should be noted that formula (6.57) gives the area of the curvilinear trape- 
zoid under the condition that f(x) > 0 on [a,b]. If f is an arbitrary integrable 
function, then the integral (6.57) obviously gives the algebraic sum of the areas of 
corresponding curvilinear trapezoids lying above and below the x-axis. When this 
is done, the areas of trapezoids lying above the x-axis are summed with a positive 
sign and those below with a negative sign. 


6.4.4 Volume of a Solid of Revolution 


Now suppose the curvilinear trapezoid shown in Fig. 6.2 is revolved about the closed 
interval [a, b]. Let us determine the volume of the solid that results. 

We denote by V(qa, f) the volume of the solid obtained by revolving the curvi- 
linear trapezoid af (a) f(B)B (see Fig. 6.2) corresponding to the closed interval 
[o, B] C [a, 5]. 
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According to our ideas about volume the following relations must hold: if a < 
a<B<y<b,then 


Va@,y)=V@, B)+ VB, y) 


and 
2 2 
(int, £0) (Ba) <Va.B)<x( sup f(a)) (Ba), 
XELA, xELQ, 


In this last relation we have estimated the volume V(a, 8) by the volumes of 
inscribed and circumscribed cylinders and used the formula for the volume of a 
cylinder (which is not difficult to obtain, once the area of a disk has been found). 

Then by Proposition | 


b 
viab)=n f f? (x) dx. (6.58) 


Example 6 By revolving about the x-axis the semicircle bounded by the closed in- 
terval [—R, R] of the axis and the arc of the circle y = V R2 — x2, —R <x < R, one 
can obtain a three-dimensional ball of radius R whose volume is easily computed 
from (6.58): 


7 4 
aa (R? — x?) dx = <7 R?. 
_R 3 


More details on the measurement of lengths, areas, and volumes will be given in 
Part 2 of this course. At that time we shall solve the problem of the invariance of the 
definitions we have given. 


6.4.5 Work and Energy 


The energy expenditure connected with the movement of a body under the influence 
of a constant force in the direction in which the force acts is measured by the product 
F- S of the magnitude of the force and the magnitude of the displacement. This 
quantity is called the work done by the force in the displacement. In general the 
directions of the force and displacement may be noncollinear (for example, when 
we pull a sled by a rope), and then the work is defined as the inner product (F, S) of 
the force vector and the displacement vector. 

Let us consider some examples of the computation of work and the use of the 
related concept of energy. 


Example 7 The work that must be performed against the force of gravity to lift a 
body of mass m vertically from height h; above the surface of the Earth to height 
hz is, by the definition just given, mg (hz —h,). Itis assumed that the entire operation 
occurs near the surface of the Earth, so that the variation of the gravitational force 
mg can be neglected. The general case is studied in Example 10. 
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Example 8 Suppose we have a perfectly elastic spring, one end of which is attached 
at the point 0 of the real line, while the other is at the point x. It is known that the 
force necessary to hold this end of the spring is kx, where k is the modulus of the 
spring. 

Let us compute the work that must be done to move the free end of the spring 
from position x =a tox =b. 

Regarding the work A(q, 8) as an additive function of the interval [a, 6] and 
assuming valid the estimates 


Att Oe OS ate A) = sup (kx) 2), 


xe[a,B] 


we atrive via Proposition | at the conclusion that 


2 \b 


b kx 
A(a, b)= kx dx = — 
- 2 


a 


This work is done against the force. The work done by the spring during the same 
displacement differs only in sign. 

The function U(x) = a that we have found enables us to compute the work 
we do in changing the state of the spring, and hence the work that the spring must 
do in returning to its initial state. Such a function U(x), which depends only on the 
configuration of the system, is called the potential energy of the system. It is clear 
from the construction that the derivative of the potential energy gives the force of 
the spring with the opposite sign. 

If a point of mass m moves along the axis subject to this elastic force F = —kx, 
its coordinate x(t) as a function of time satisfies the equation 


mi = —kx. (6.59) 


We have already verified once (see Sect. 5.6.6) that the quantity 


mv-— kx? 


which is the sum of the kinetic and (as we now understand) potential energies of the 
system, remains constant during the motion. 


Example 9 We now consider another example. In this example we shall encounter a 
number of concepts that we have introduced and become familiar with in differential 
and integral calculus. 

We begin by remarking that by analogy with the function (6.60), which was writ- 
ten for a particular mechanical system satisfying Eq. (6.59), one can verify that for 
an arbitrary equation of the form 


5(t) = f(s@), (6.61) 
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Fig. 6.3 


FT 
0 
where f(s) is a given function, the sum 
ge 
. +U(s)=E (6.62) 
does not vary over time if U’(s) = — f(s). 
Indeed, 
dE ids? dUG) 2... dU ds 2», 
goo @ oe ae 


Thus by (6.62), regarding E as a constant, we obtain successively, first 


§=+,/2(E — U(s)) 


(where the sign must correspond to the sign of the derivative ), then 


dt 1 


ds J2(E—UG))’ 


and finally 


t=c,2i 


/ ds 
V2(E — U(s)) 

Consequently, using the law of conservation of the “energy” (6.62) in Eq. (6.61), 
we have succeeded theoretically in solving this equation by finding not the function 
s(t), but its inverse f(s). 

Equation (6.61) arises, for example, in describing the motion of a point along a 
given curve. Suppose a particle moves under the influence of the force of gravity 
along a narrow ideally smooth track (Fig. 6.3). 

Let s(t) be the distance along the track (that is, the length of the path) from a 
fixed point O — the origin of the measurement — to the point where the particle is at 
time ft. It is clear that then s(f) is the magnitude of the velocity of the particle and 
S(t) is the magnitude of the tangential component of its acceleration, which must 
equal the magnitude of the tangential component of the force of gravity at a given 
point of the track. It is also clear that the tangential component of the force of gravity 
depends only on the point of the track, that is, it depends only on s, since s can be 
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Fig. 6.4 


—Yo 


—So SO 
0 s(t) 


regarded as a parameter that parametrizes the curve’ with which we are identifying 
the track. If we denote this component of the force of gravity by f(s), we find that 


ms = f(s). 


For this equation the following quantity will be preserved: 
1 
ra +U(s)=E, 


where U’(s) = — f (s). 

Since the term smi? is the kinetic energy of the point and the motion along the 
track is frictionless, we can guess, avoiding calculations, that the function U(s), 
up to a constant term, must have the form mgh(s), where mgh(s) is the potential 
energy of a point of height /(s) in the gravitational field. 

If the relations s(0) = 0, s(0) = sg, and h(sg) = ho held at the initial time t = 0, 
then by the relations 


2E oy 
— =s°4+2gh(s)=C 
m 
we find that C = 2gh(so), and therefore v= 2g(ho — h(s)) and 


: ds 
t= ————————_, (6.63) 
so V28(ho — h(s)) 

In particular if, as in the case of a pendulum, the point moves along a circle of 
radius R, the length s is measured from the lowest point O of the circle, and the 
initial conditions amount to the equality s(0) = 0 at t = 0 and a given initial angle 
of displacement —gp (see Fig. 6.4). Then, as one can verify, expressing s and h(s) 
in terms of the angle of displacement ¢g, we obtain 


pe: [ ds = [ Rdy 
ds V2¢ho = hs) Sy V2 R(Cos fh — COs G0)" 


°The parametrization of a curve by its own arc length is called its natural parametrization, and s is 
called the natural parameter. 
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= ef aid (6.64) 
2V 8 Jw (sin?  — sin” vy 


Thus for a half-period 5T of oscillation of the pendulum we obtain 


rat ley” (6.65) 
(sin? @ — sin? ¥ © — sin 2" 


(sin? © — sin? & 


or 


: ue sin(y/2) 
from which, after the substitution STC ae sin6, we find 


T=4 Se oe (6.66) 
Vado Vi—ksin2a- 
where k? = sin 


We recall that the function 


~ do 
F(k,g)= | ~———— 
0 V1—k2sin20 


is called an elliptic integral of first kind in the Legendre form. For g = 7/2 it de- 
pends only on k?, is denoted K (k), and is called the complete elliptic integral of first 
kind. Thus, the period of oscillation of the pendulum is 


R 
T=4 B K(k). (6.67) 


If the initial displacement angle go is small, we can set k = 0, and then we obtain 


the approximate formula 
[R 
TX 20 /—. (6.68) 
& 


Now that formula (6.66) has been obtained it is still necessary to examine the 
whole chain of reasoning. When we do, we notice that the integrands in the inte- 
grals (6.63)-(6.65) are unbounded functions on the interval of integration. We en- 
countered a similar difficulty in studying the length of a curve, so that we have an 
approximate idea of how to interpret the integrals (6.63)—(6.65). 

However, given that this problem has arisen for the second time, we should study 
it in a precise mathematical formulation, as will be done in the next section. 


2 go 
7° 


Example 10 A body of mass m rises above the surface of the Earth along the trajec- 
tory th (x(t), y(t), z(t)), where ¢ is time, a < t < b, and x, y, z are the Cartesian 
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coordinates of the point in space. It is required to compute the work of the body 
against the force of gravity during the time interval [a, b]. 

The work A(q, f) is an additive function of the time interval [a, 6] C [a, b]. 

A constant force F acting on a body moving with constant velocity v performs 
work (F, vh) = (F, v)h in time h, and so the estimate 


inf (F(p@)), v(t))(B — @) < A(@, B) < sup (F(p@)), vO)(6 — a) 


f 
te[a,B] tela, B] 


seems natural, where v(t) is the velocity of the body at time f, p(t) is the point in 
space where the body is located at time t, and F(p(t)) is the force acting on the 
body at the point p = p(t). 

If the function (F(p(t)), v(t)) happens to be integrable, then by Proposition 1 we 
must conclude that 


b 
Aca.b)= [ (F(p(t)), v(t) de. 


In the present case v(t) = (x(t), y(t), z(t)), and if r(t) = (x(t), y(t), z(t)), then 
by the law of universal gravitation, we find 


F(p) = F( ) au. GmM 
=F(x,y,z)= = 
e Ire G2 +y2+2 


372 (,y,Z), 


where M is the mass of the Earth and its center is taken as the origin of the coordi- 
nate system. 
Then, 


x(t)x(t) tyOy@O) +220) 
CeCe? at) faa) ae 


(F, v)(t) =GmM 


and therefore 


b 1 > @O+YO+2O0Y . 
/ ae ome | +r +2O)P 


_ GmM fo GmM a 
H+yO+22())/2), IO a’ 
Thus 
AG.) = GmM _ Gm 
Ir(b)| — |r(a)| 


We have discovered that the work we were seeking depends only on the magni- 
tudes |r(a)| and |r(b)| of the distance of the body of mass m from the center of the 
Earth at the initial and final instants of time in the interval [a, b]. 

Setting 


GM 
U(r) =~ 9 
r 
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Fig. 6.5 F 


we find that the work done against gravity in displacing the mass m from any point 
of a sphere of radius rp to any point of a sphere of radius r; is computed by the 
formula 


Aror, = m(U (ro) — U(r1)). 


The function U(r) is called a Newtonian potential. If we denote the radius of the 


Earth by R, then, since GM = g, we can rewrite U(r) as 
R2 
tHe 
r 


Taking this into account, we can obtain the following expression for the work 
needed to escape from the Earth’s gravitational field, more precisely, to move a 
body of mass m from the surface of the Earth to an infinite distance from the center 
of the Earth. It is natural to take that quantity to be the limit lim,_,+4.0 Ar;. 

Thus the escape work is 


; . gk? gk? 
A=ArRo= lim Ar-= lim m| — — — ]=mgR. 
r—+00 r—>+00 R r 


6.4.6 Problems and Exercises 


1. Figure 6.5 shows the graph of the dependence F = F(x) of a force acting along 
the x-axis on a test particle located at the point x on the axis. 


a) Sketch the potential for this force in the same coordinates. 

b) Describe the potential of the force — F(x). 

c) Investigate to determine which of these two cases is such that the position xo 
of the particle is a stable equilibrium position and what property of the potential is 
involved in stability. 


2. Based on the result of Example 10, compute the velocity a body must have in 
order to escape from the gravitational field of the Earth (the escape velocity for the 
Earth). 
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Fig. 6.6 
y 
| 
| 
| 
0 & 
3. On the basis of Example 9 
a) derive the equation Rg = —g sing for the oscillations of a mathematical pen- 
dulum; 
b) assuming the oscillations are small, obtain an approximate solution of this 
equation; 


c) from the approximate solution, determine the period of oscillation of the pen- 
dulum and compare the result with formula (6.68). 


4. A wheel of radius r rolls without slipping over a horizontal plane at a uniform 
velocity v. Suppose at time ¢ = 0 the uppermost point A of the wheel has coordi- 
nates (0, 2r) in a Cartesian coordinate system whose x-axis lies in the plane and is 
directed along the velocity vector. 


a) Write the law of motion tb (x(t), y(t)) of the point A. 

b) Find the velocity of A as a function of time. 

c) Describe graphically the trajectory of A. (This curve is called a cycloid.) 

d) Find the length of one arch of the cycloid (the length of one period of this 
periodic curve). 

e) The cycloid has a number of interesting properties, one of which, discovered 
by Huygens!” is that the period of oscillation of a cycloidal pendulum (a ball rolling 
in a cycloidal well) is independent of the height to which it rises above the lowest 
point of the well. Try to prove this, using Example 9. (See also Problem 6 of the 
next section, which is devoted to improper integrals.) 


5. a) Starting from Fig. 6.6, explain why, if y= f(x) and x = g(y) are a pair of 
mutually inverse continuous nonnegative functions equal to 0 at x = 0 and y=0 
respectively, then the inequality 


y 


xy< is finart [gina 
0 0 


must hold. 


10Ch. Huygens (1629-1695) — Dutch engineer, physicist, mathematician, and astronomer. 
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b) Obtain Young’s inequality 


1 1 
S95 a? 
P q 
from a) forx,y >0, p,q >0,5+7=1. 
c) What geometric meaning does equality have in the inequalities of a) and b)? 


6. The Buffon needle problem.'! The number x can be computed in the following 
rather surprising way. 

We take a large sheet of paper, ruled into parallel lines a distance h apart and we 
toss a needle of length / < h at random onto it. Suppose we have thrown the needle 
N times, and on n of those times the needle landed across one of the lines. If N 
is sufficiently large, then 7 ~ ae where p = 3 is the approximate probability that 
the needle will land across a line. 

Starting from geometric considerations connected with the computation of area, 
try to give a satisfactory explanation of this method of computing zr. 


6.5 Improper Integrals 


In the preceding section we encountered the need for a somewhat broader concept of 
the Riemann integral. There, in studying a particular problem, we formed an idea of 
the direction in which this should be done. The present section is devoted to carrying 
out those ideas. 


6.5.1 Definition, Examples, and Basic Properties of Improper 
Integrals 


Definition 1 Suppose the function x +> f(x) is defined on the interval [a, +oo[ 
and integrable on every closed interval [a, b] contained in that interval. 


The quantity 
+00 


b 
f(x)dx:= lim / S (x) dx, 
7 b>+00 Ja 
if this limit exists, is called the improper Riemann integral or the improper integral 
of the function f over the interval [a, +00[. 

The expression [| — Ff (x) dx itself is also called an improper integral, and in that 
case we say that the integral converges if the limit exists and diverges otherwise. 
Thus the question of the convergence of an improper integral is equivalent to the 
question whether the improper integral is defined or not. 


'J.L.L. Buffon (1707-1788) — French experimental scientist. 
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Integration 


Example I Let us investigate the values of the parameter a for which the improper 


integral 


TO dy 


converges, or what is the same, is defined. 
Since 


> dx i fora #1, 


the limit 


exists only fora > 1. 
Thus, 


© dx 1 . 
== r ifa > 1, 
1 xX 


a-— 


(6.69) 


and for other values of the parameter a the integral (6.69) diverges, that is, is not 


defined. 


Definition 2 Suppose the function x + f(x) is defined on the interval [a, B[ and 


integrable on any closed interval [a, b] C [a, B[. The quantity 


B b 
/ f(x) dx := lim | f(x) dx, 


if this limit exists, is called the improper integral of f over the interval [a, B[. 


The essence of this definition is that in any neighborhood of B the function f 


may happen to be unbounded. 


Similarly, if a function x + f(x) is defined on the interval ]A, b] and integrable 


on every closed interval [a, b] C ]A, b], then by definition we set 


b b 
[fear tim f f(x) dx 


and also by definition we set 


b b 
/ f(x) dx i= tim, | f(x) dx. 
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Example 2 Let us investigate the values of the parameter a for which the integral 


! ax 
0 x* 


(6.70) 


converges. 
Since for a € ]0, 1] 


[S- pet, ife Al, 
ae line, ifa=1, 
it follows that the limit 

ay 1 


lim = 
a>+0Jq x® l-a 


exists only fora < 1. 
Thus the integral (6.70) is defined only for a < 1. 


Example 3 


0 0 
: ; : 0 . 
/ e‘dx= lim et dx = tim (e* i) = lim (1- e“) =1. 


Lae a>-o Jaq a>—oo 


Since the question of the convergence of an improper integral is answered in the 
same way for both integrals over an infinite interval and functions unbounded near 
one of the endpoints of a finite interval of integration, we shall study both of these 
cases together from now on, introducing the following basic definition. 


Definition 3 Let [a, w[ be a finite or infinite interval and x f(x) a function de- 
fined on that interval and integrable over every closed interval [a, b] C [a, w[. Then 
by definition 


(a) b 
/ f(x) dx := lim ij f(x) dx, (6.71) 
a b>o Ja 
if this limit exists as b > w, b € [a, of. 


From now on, unless otherwise stated, when studying the improper integral 
(6.71) we shall assume that the integrand satisfies the hypotheses of Definition 3. 

Moreover, for the sake of definiteness we shall assume that the singularity (“im- 
propriety”) of the integral arises from only the upper limit of integration. The study 
of the case when it arises from the lower limit is carried out word for word in exactly 
the same way. 

From Definition 3, properties of the integral, and properties of the limit, one can 
draw the following conclusions about properties of an improper integral. 
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Proposition 1 Suppose x > f(x) and x — g(x) are functions defined on an in- 
terval [a, w[ and integrable on every closed interval [a,b] C [a, w[. Suppose the 


improper integrals 


/ : f(x) dx, (6.72) 


/ g(x) dx (6.73) 


are defined. 

Then a) ifw € Rand f € Ra, w], the values of the integral (6.72) are the same, 
whether it is interpreted as a proper or an improper integral; 

b) for any 1, 42 € R the function (A; f + 428) (x) is integrable in the improper 
sense on [a, w[ and the following equality holds: 


[ cre meemar=a f Flx)de +22 f eid: 
c) ifc €[a, w[, then 


[ rear= | flaydr + | Pore 


d) if g: [a, yl >[a, @[ is a smooth strictly monotonic mapping with g(a) =a 
and ~(B) > was B > y, B € [a, y[, then the improper integral of the function 
tr (f og)(t\¢'(t) over [a, y[ exists and the following equality holds: 


w y 
[ tears f romme'ma. 
a a 
Proof Part a) follows from the continuity of the function 


b 
F(b) = f(x) dx 


on the closed interval [a, w] on which f € Ra, a]. 
Part b) follows from the fact that for b € [a, a[ 


b b b 
/ Gift rogyanrde =a f fla)dr +a f g(x) dx. 


Part c) follows from the equality 


b Cc b 
[ feou=/ fe)de + f fax 


which holds for all b, c € [a, w[. 
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Part d) follows from the formula for change of variable in the definite integral: 


b=9(B) B 
/ fo\dee / (f opty (t)dr. 


=9(@) 


Remark 1 To the properties of the improper integral expressed in Proposition | we 
should add the very useful rule for integration by parts in an improper integral, 
which we give in the following formulation: 


If f.g € Ca, w[ and the limit lim en (f - g)(x) exists, then the functions 
xX€Eld,| 


fg’ and f'- g are either both integrable or both nonintegrable in the improper 
sense on [a, w[, and when they are integrable the following equality holds: 


[ur ANGER oor -f( (F’ 8) (dx, 


where 


(f-g)@)|?= lim (f-g)(®)-(f-8)@). 


x>O 
x€[a,o[ 


Proof This follows from the formula 


[e@ we =—-o/° - fC fi-g)(x)dx 


a 


for integration by parts in a proper integral. 


Remark 2 It is clear from part c) of Proposition | that the improper integrals 


[tou and [tox 


either both converge or both diverge. Thus, in improper integrals, as in series, con- 
vergence is independent of any initial piece of the series or integral. 

For that reason, when posing the question of convergence of an improper integral, 
we sometimes omit entirely the limit of integration at which the integral does not 
have a singularity. 

With that convention the results obtained in Examples | and 2 can be rewritten 
as follows: 


the integral [ = & converges only for a > 1; 
the integral /’ i Se oorwenes only for a < 1. 


The sign +0 in the last integral shows that the region of integration is contained 
inx > 0. 
By achange of variable in this last integral, we immediately find that the integral 


dx 
xo+0 Geax)? COnVerges only for a < 1. 
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6.5.2 Convergence of an Improper Integral 


a. The Cauchy Criterion 


By Definition 3, the convergence of the improper integral (6.71) is equivalent to the 
existence of a limit for the function 


b 
F(b)= i S (x) dx (6.74) 


asb—>a,beé[a,of. 
This relation is the reason why the following proposition holds. 


Proposition 2 (Cauchy criterion for convergence of an improper integral) [f the 
function x +> f (x) is defined on the interval [a, w[ and integrable on every closed 
interval [a, b] C [a, |, then the integral i f (x) dx converges if and only if for 
every € > 0 there exists B € [a, w[ such that the relation 


b2 
if f(x) dx 
by 


holds for any b,, b2 € [a, o[ satisfying B < by and B < bo. 


<€é 


Proof As a matter of fact, we have 


by by bi 
i fsydr= [ flx)dx — | f(x) dx = F(b2) — F(b1), 
1 a a 


and therefore the condition is simply the Cauchy criterion for the existence of a limit 
for the function F(b) asb > w, b € [a, of. 


b. Absolute Convergence of an Improper Integral 


Definition 4 The improper integral ipa J (x) dx converges absolutely if the integral 
J \f \(@) dx converges. 
Because of the inequality 


by 
f(x) dx 
by 


= 


bg 
i | f(x) dx 
b 


1 


and Proposition 2, we can conclude that if an integral converges absolutely, then it 
converges. 

The study of absolute convergence reduces to the study of convergence of inte- 
grals of nonnegative functions. But in this case we have the following proposition. 
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Proposition 3 [fa function f satisfies the hypotheses of Definition 3 and f (x) => 0 
n [a, w[, then the improper integral (6.71) exists if and only if the function (6.74) 
is bounded on [a, of. 


Proof Indeed, if f(x) => 0 on [a, w[, then the function (6.74) is nondecreasing on 
[a, [, and therefore it has a limit as b > w, b € [a, o[, if and only if itis bounded. 


As an example of the use of this proposition, we consider the following corollary 
of it. 


Corollary 1 (Integral test for convergence of a series) If the function x + f(x) 
is defined on the interval [1, +00[, nonnegative, nonincreasing, and integrable on 
each closed interval [1, b] C [1, +00, then the series 


VfM=fY+ f+ 


n=1 


and the integral 
+00 


f(x) dx 
either both converge or both diverge. 


Proof It follows from the hypotheses that the inequalities 


n+l 
fa4+)< / F(x) dx < fn) 


hold for any n € N. After summing these inequalities, we obtain 


k k+l k 
Y fat )< / f(x)dx < * f(n) 


1 


n=1 n=1 


or 
sti - fs F&+D ss, 


where s; = y 4 f(@) and F(b) = i Ff (x) dx. Since sx and F(b) are nondecreas- 
ing functions of their arguments, these inequalities prove the proposition. 


In particular, one can say that the result of Example | is equivalent to the assertion 
that the series 


converges only fora > 1. 
The most frequently used corollary of Proposition 3 is the following theorem. 
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Theorem 1 (Comparison theorem) Suppose the functions x — f(x) and x > g(x) 
are defined on the interval [a, w[ and integrable on any closed interval [a,b] C 
[a, ol. 


If 
O< f@) <8) 


on [a, w[, then convergence of the integral (6.73) implies convergence of (6.72) and 


the inequality 
[ teas [ea 


holds. Divergence of the integral (6.72) implies divergence of (6.73). 


Proof From the hypotheses of the theorem and the inequalities for proper Riemann 
integrals we have 


b b 
Fib)= | flx)de s | gar) 


for any b € [a, w[. Since both functions F and G are nondecreasing on [a, w[, the 
theorem follows from this inequality and Proposition 3. 


Remark 3 If instead of satisfying the inequalities 0 < f(x) < g(x) the functions f 
and g in the theorem are known to be nonnegative and of the same order as x > o, 
x €[a, w[, that is, there are positive constants c; and c2 such that 


cif (x) < gg) <of(x), 


then by the linearity of the improper integral and theorem just proved, in this case 
we can conclude that the integrals (6.72) and (6.73) either both converge or both 
diverge. 


Example 4 The integral 


converges, since 


asx > +o. 


Example 5 The integral 
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converges absolutely, since 


for x > 1. Consequently, 


i 
1 


COS X 
2 


Xx 


cos x oe too 4 
5 dx| < X< = dx =1. 
x 1 1 Xx 


Example 6 The integral 


converges, since e 


—x2 


+00, 
i e* dx 
1 


<e~ for x > 1 and 


+00 > +00 | 
/ e* ax< f e*dx=-. 
1 1 e 


Example 7 The integral 


diverges, since 


for sufficiently large 


values of x. 


Example § The Euler integral 


converges, since 


as x > +0. 


m/2 
i Insin x dx 
0 


. 1 
|Insinx| ~ |Inx| < — 


AX 


Example 9 The elliptic integral 


a dx 
0 JU —x2)(1 — k2x2) 
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converges for 0 < k? <1, since 


vu — x?)(1 — k?x?) ~ V2(1 = k)(1 — x)! 


asx —> 1-0. 
Example 10 The integral 


i dé 
0 J/cosé —cos@ 


converges, since 


7 -—6 
cos@ = cong = [sin 2* sin © 5 ~ J/sing(y — 6)!/? 


2 


as 9 > po —0. 


Example 11 The integral 


L %? dy 
T=2 (6.75) 
8/0 | /sin? ® sin’ $ 
converges for 0 < gp < 7 since as yy —> go — 0 we have 
sin? a — sin? o ~ J/singn(gy — ¥)!/?. (6.76) 


Relation (6.75) expresses the dependence of the period of oscillations of a pendu- 
lum on its length LZ and its initial angle of displacement, measured from the radius it 
occupies at the lowest point of its trajectory. Formula (6.75) is an elementary version 
of formula (6.65) of the preceding section. 

A pendulum can be thought of, for example, as consisting of a weightless rod, 
one end of which is attached by a hinge while the other end, to which a point mass 
is attached, is free. 

In that case one can speak of arbitrary initial angles go € [0, 2]. For gg = 0 and 
go = 7, the pendulum will not oscillate at all, being in a state of stable equilibrium 
in the first case and unstable equilibrium in the second. 

It is interesting to note that (6.75) and (6.76) easily imply that T — oo as gp > 
x — 0, that is, the period of oscillation of a pendulum increases without bound as its 
initial position approaches the upper (unstable) equilibrium position. 


c. Conditional Convergence of an Improper Integral 


Definition 5 If an improper integral converges but not absolutely, we say that it 
converges conditionally. 
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Example 12 Using Remark 1, by the formula for integration by parts in an improper 


integral, we find that 
— + cos x + cos x 
5 dx = 5 dx, 
/2 m/2 x m/2 x 


TO sinx cos x 

dx = 
x/2 XxX XxX 

provided the last integral converges. But, as we saw in Example 5, this integral 

converges, and hence the integral 


+ sing 
dy (6.77) 
m/2 x 


also converges. 
At the same time, the integral (6.77) is not absolutely convergent. Indeed, for 
b € [m/2, +00[ we have 


b > sin? x 1° dx 1 f% cos2x 
de dy= dx. (6.78) 
x/2 n/2 x 2Inj2 X% 2Iqaj2 x 


The integral 
/ + cos 2x 
dx, 
m/2 x 


as can be verified through integration by parts, is convergent, so that as b > +00, 
the difference on the right-hand side of relation (6.78) tends to ++-oo. Thus, by esti- 
mate (6.78), the integral (6.77) is not absolutely convergent. 

We now give a special convergence test for improper integrals based on the sec- 
ond mean-value theorem and hence essentially on the same formula for integration 
by parts. 


sin. x 


x 


Proposition 4 (Abel—Dirichlet test for convergence of an integral) Let x + f(x) 
and x +> g(x) be functions defined on an interval [a, w[ and integrable on every 
closed interval [a, b] C [a, w[. Suppose that g is monotonic. 

Then a sufficient condition for convergence of the improper integral 


/ (f - g)(x) dx (6.79) 


is that the one of the following pairs of conditions hold: 


a) the integral [. ty f (x) dx converges, 
B,) the function g bounded on [a, of, 


or 


a2) the function F(b) = ie f (x) dx is bounded on [a, of, 
B2) the function g(x) tends to zeroas x > w, x €[a, af. 
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Proof For any b, and b in [a, w[ we have, by the second mean-value theorem, 


bo & bo 
[ ¢-ama=eon | Forde +atb2) [ Flax) dx, 


where & is a point lying between b; and b2. Hence by the Cauchy convergence 
criterion (Proposition 2), we conclude that the integral (6.79) does indeed converge 
if either of the two pairs of conditions holds. 


6.5.3 Improper Integrals with More than One Singularity 


Up to now we have spoken only of improper integrals with one singularity caused 
either by the unboundedness of the function at one of the endpoints of the interval 
of integration or by an infinite limit of integration. In this subsection we shall show 
in what sense other possible variants of an improper integral can be taken. 

If both limits of integration are singularities of either of these two types, then by 
definition 


w2 G w2 
/ f(x) dx = pode [ f(x) dx, (6.80) 
@| @| Cc 
where c is an arbitrary point of the open interval Ja), w2[. 

It is assumed here that each of the improper integrals on the right-hand side of 
(6.80) converges. Otherwise we say that the integral on the left-hand side of (6.80) 
diverges. 

By Remark 2 and the additive property of the improper integral, the definition 
(6.80) is unambiguous in the sense that it is independent of the choice of the point 
c € Jay, wl. 


Example 13 
[ dx [ dx | 
1Vv1— x2 1V1—x? 0 V1l—x2 


: 0 : 1 . 1 
= arcsinx| + arcsin x |, = arcsinx| =T. 


Example 14 The integral 


+00, 
i. e* dx 
—CO 


is called the Euler—Poisson integral, and sometimes the Gaussian integral. It obvi- 
ously converges in the sense given above. It will be shown later that its value is ./7. 
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+O dy 


diverges, since for every a at least one of the two integrals 


Example 15 The integral 


diverges. 


Example 16 The integral 


converges if each of the integrals 


! sin x + sin x 
ne dx and dx 
x 1 


0 xe 
converges. The first of these integrals converges if a < 2, since 


sin x 1 
os 


xo xa 


as x — +0. The second integral converges if a > 0, as one can verify directly 
through an integration by parts similar to the one shown in Example 12, or by citing 
the Abel—Dirichlet test. Thus the original integral has a meaning for 0 <a <2. 

In the case when the integrand is not bounded in a neighborhood of one of the 
interior points w of the closed interval of integration [a, b], we set 


b o b 
/ f(x) dx =) flxyde + f f(x) dx, (6.81) 
a a @w 
requiring that both of the integrals on the right-hand side exist. 


Example 17 In the sense of the convention (6.81) 
[ dx 
—_=4 
-1 7 |x| 
1 dx 


Example 18 The integral f_, & is not defined. 

Besides (6.81), there is a second convention about computing the integral of a 
function that is unbounded in a neighborhood of an interior point w of a closed 
interval of integration. To be specific, we set 


b o—5 b 
pv f f(x)dx:= lim (/ fl)dr + | fis)dr), (6.82) 
a 6 +0 a w+6d 
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if the integral on the right-hand side exists. This limit is called, following Cauchy, 
the principal value of the integral, and, to distinguish the definitions (6.81) and 
(6.82), we put the letters PV in front of the second to indicate that it is the principal 
value. 

In accordance with this convention we have 


h dee 
PV —=0. 
-1 x 


We also adopt the following definition: 


Example 19 


+00 R 
PV / f(x)dx:= lim i f (x) dx. (6.83) 
—~oo R-+-+00 J_R 


Example 20 


+00 
pv [ xdx =0. 


—0o 
Finally, if there are several (finitely many) singularities of one kind or another on 
the interval of integration, at interior points or endpoints, then the nonsingular points 
of the interval are divided into a finite number of such intervals, each containing 
only one singularity, and the integral is computed as the sum of the integrals over 
the closed intervals of the partition. 
It can be verified that the result of such a computation is not affected by the 
arbitrariness in the choice of a partition. 


Example 21 The precise definition of the logarithmic integral can now be written 
as 
x dt 


: 0 inf? if0<x <1, 
lix= edt: 
PY fj pee eK 


In the last case the symbol PV refers to the only interior singularity on the interval 
]0, x[, which is located at 1. We remark that in the sense of the definition in formula 
(6.81) this integral is not convergent. 


6.5.4 Problems and Exercises 


1. Show that the following functions have the stated properties. 


a) SiG) = jf, sint dt (the sine integral) is defined on all of R, is an odd function, 
and has a limit as x > +00. 

b) si(x) = — f° *™ dr is defined on all of R and differs from Six only by a 
constant; 
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c) Cix = — f° 28! dr (the cosine integral) can be computed for sufficiently 


~ sinx. 


large values of x by the approximate formula Cix ~ ; estimate the region of 


values where the absolute error of this approximation is fee than 1074. 


2. Show that 


a) the integrals /, a sa dxf; *9° cOS® dy converge only for w > 0, and abso- 
lutely only for a > 1; 
b) the Fresnel integrals 


oe eee eo 
ca)= = | cos t* dt, se=— | sint* dt 


are infinitely differentiable functions on the interval ]0, +-oo[, and both have a limit 
aS x > +00. 


3. Show that 
a) the elliptic integral of first kind 


sing dt 
Fk, g)= 
0 Jd—-1)0 —kt?) 
is defined forO <k <1,0<@ <4 and can be brought into the form 


y dy 
F(k, o=| << 
9 /1—k sin? 


b) the complete elliptic integral of first kind 


Ki) = i 
e ji Bsn y wv 


increases without bound as k > 1 — 0. 
4. Show that 


a) the exponential integral Ei(x) = dows © dt i is defined and infinitely differen- 
tiable for x < 0; _ 

b) —Bi(-x) = SU - $+ 3 + CD" + 0(ge)) as x > +00; 

c) the series pee o(- 1 1’ does not converge for any value of x € R; 

d) li(x)~ +» asx > +0. (For the definition of the logarithmic integral li(x) 


Inx 


see Example 21.) 


5. Show that 


a) the function ®(x) = += {8 =i dt, called the error function and often de- 


noted erf(x), is defined, even, and infinitely differentiable on R and has a limit as 
x—> +00; 
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b) if the limit in a) is equal to | (and it is), then 


2 ay) 
erf(x) = = | e! dt= 
0 
1 2 _2f 1 1 Pee DS 5. 1 
I e 
J 2x 22x3  23x5 24x7 eu 


6. Prove the following statements. 


as x > +00. 


a) If a heavy particle slides under gravitational attraction along a curve given 
in parametric form as x = x(@), y = y(@), and at time t = 0 the particle had zero 
velocity and was located at the point x9 = x(@0), yo = y(0o), then the following 
relation holds between the parameter 6 defining a point on the curve and the time ¢ 
at which the particle passes this point (see formula (6.63) of Sect. 6.4) 


a [ / OP +O? ag 

Jey Y  280%0- yO)” 
in which the improper integral necessarily converges if y’(69) 4 0. (The ambiguous 
sign is chosen positive or negative according as ¢ and @ have the same kind of 
monotonicity or the opposite kind; that is, if an increasing @ corresponds to an 
increasing ¢, then one must obviously choose the positive sign.) 


b) The period of oscillation of a particle in a well having cross section in the 
shape of a cycloid 


x=R(0+7+sin60), 
|O| <x, 


y=—R(1+ cos), 


is independent of the level yp = —R(1 + cos 69) from which it begins to slide and is 
equal to 42. R/g (see Problem 4 of Sect. 6.4). 


Chapter 7 
Functions of Several Variables: Their Limits 
and Continuity 


Up to now we have considered almost exclusively numerical-valued functions x b> 
jf (x) in which the number f(x) was determined by giving a single number x from 
the domain of definition of the function. 

However, many quantities of interest depend on not just one, but many factors, 
and if the quantity itself and each of the factors that determine it can be char- 
acterized by some number, then this dependence reduces to the fact that a value 
y = f(«!,...,x”) of the quantity in question is made to correspond to an ordered 
set (x!, ..., x”) of numbers, each of which describes the state of the corresponding 
factor. The quantity assumes this value when the factors determining this quantity 
are in these states. 

For example, the area of a rectangle is the product of the lengths of its sides. The 
volume of a given quantity of gas is computed by the formula 


mT 
V=R—, 
P 

where R is a constant, m is the mass, T is the absolute temperature, and p is the 
pressure of the gas. Thus the value of V depends on a variable ordered triple of 
numbers (m, T, p), or, as we say, V is a function of the three variables m, T, and p. 

Our goal is to learn how to study functions of several variables just as we learned 
how to study functions of one variable. 

As in the case of functions of one variable, the study of functions of several 
numerical variables begins by describing their domains of definition. 


7.1 The Space R” and the Most Important Classes of Its Subsets 


7.1.1 The Set R™ and the Distance in It 


We make the convention that R” denotes the set of ordered m-tuples (x!,..., x”) 
of real numbers x’ € R (i = 1,...,m). 
© Springer-Verlag Berlin Heidelberg 2015 409 
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Each such m-tuple will be denoted by a single letter x = (x!,...,x”) and, in 
accordance with convenient geometric terminology, will be called a point of R”. 
The number x! in the set (x!,...,x’”) will be called the ith coordinate of the point 
x= (x!,..., x). 

The geometric analogies can be extended by introducing a distance on R” be- 
tween the points x; = Gis wey) and x2 = coe ...,5') according to the formula 


d(x1,x2)= | >o(x} — x)’. (7.1) 
i=l 


The function 
d:R” xR" >R 
defined by the formula (7.1) obviously has the following properties: 


a) d(x1, x2) = 0; 

b) (d(x1, x2) =0) & (1 = x2); 

c) d(x1, x2) = d(x2, x1); 

d) d(x1, x3) < d(x1, x2) + d(x2, x3). 


This last inequality (called, again because of geometric analogies, the triangle 
inequality) is a special case of Minkowski’s inequality (see Sect. 5.4.2). 

A function defined on pairs of points (x1, x2) of a set X and possessing the prop- 
erties a), b), c), and d) is called a metric or distance on X. 

A set X together with a fixed metric on it is called a metric space. 

Thus we have turned R” into a metric space by endowing it with the metric given 
by relation (7.1). 

The reader can get information on arbitrary metric spaces in Chap. 9 (Part 2). 
Here we do not wish to become distracted from the particular metric space R” that 
we need at the moment. 

Since the space R” with metric (7.1) will be our only metric space in this chapter, 
forming our object of study, we have no need for the general definition of a metric 
space at the moment. It is given only to explain the term “space” used in relation to 
IR” and the term “metric” in relation to the function (7.1). 

It follows from (7.1) that for i € {1,...,m} 


|x — x5] <d(x1, x2) < vm max |x} — x5], (7.2) 
that is, the distance between the points x;, x2 € R” is small if and only if the corre- 
sponding coordinates of these points are close together. 

It is clear from (7.2) and also from (7.1) that for m = 1, the set R! is the same 
as the set of real numbers, between whose points the distance is measured in the 
standard way by the absolute value of the difference of the numbers. 
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7.1.2 Open and Closed Sets in R™ 


Definition 1 For 6 > 0 the set 
B(a; 6) = {x € R” | d(a,x) < 5} 


is called the ball with center a € R” of radius 6 or the 6-neighborhood of the point 
aéR”. 


Definition 2 A set G C R” is open in R” if for every point x € G there is a ball 
B(x; 5) such that B(x; 5) C G. 


Example I RR” is an open set in R”. 


Example 2 The empty set @ contains no points at all and hence may be regarded as 
satisfying Definition 2, that is, @ is an open set in R”. 


Example 3 A ball B(a;r) is an open set in R”. Indeed, if x € B(a;r), that is, 
d(a, x) <r, then for 0 <6 <r—d(a,x), we have B(x; 5) C B(a;r), since 


(€ € Bx; 8)) > (d@, &) <8) > 
=> (d(a, €) < d(a,x) + d(x, &) <d(a,x) +r—d(a,x)=r). 


Example 4 A set G = {x € R” | d(a, x) >1r}, that is, the set of points whose dis- 
tance from a fixed point a € R” is larger than r, is open. This fact is easy to verify, 
as in Example 3, using the triangle inequality for the metric. 


Definition 3 The set F C R” is closed in R” if its complement G = R” \ F is open 
in R”. 


Example 5 The set B(azr) = {x € R” | d(a, x) <r}, r > 0, that is, the set of points 
whose distance from a fixed point a € R” is at most r, is closed, as follows from 
Definition 3 and Example 4. The set B(a;r) is called the closed ball with center a 
of radius r. 


Proposition 1 a) The union |),<4 Ga of the sets of any system {Gq, a € A} of open 
sets in R” is an open set in R”. 

b) The intersection ( \;_, G; of a finite number of open sets in R™ is an open set 
in R”. 

a’) The intersection ( |ye4 Fa of the sets of any system {Fy, a € A} of closed sets 
F, in R™ is a closed set in R”™. 

b’) The union \j_, Fi of a finite number of closed sets in R” is a closed set 
in R”. 
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Proof a) lf x € ee A Ga, then there exists a@ € A such that x € Gg, and conse- 
quently there is a d-neighborhood B(x; 5) of x such that B(x; 6) C Gg,. But then 
Bar) C| Jeg Ge: 

b) Let x € ‘ae G;. Then x € G; (i =1,...,7). Let 6,,..., 6, be positive num- 
bers such that B(x; 6;) C G; @ =1,...,n). Setting 6 = min{d1,...,5,}, we obvi- 
ously find that 5 > 0 and B(x; 5) C()j_, Gi. 

a’) Let us show that the set C((] Fa) complementary to () 
an open set in R”. 

Indeed, 


Sed acd fa in R” is 


(1 Fa) =| J@er=|_ J Gx 


acA acA acA 


where the sets Gy = C Fy are open in R™. Part a’) now follows from a). 
b’) Similarly, from b) we obtain 


e(U n) =(\(CF)=()Gi. 
i=1 i=l i=l 


Example 6 The set S(a;r) = {x € R” | d(a,x) =r}, r = 0, is called the sphere of 
radius r with center a € R”. The complement of S(a;r) in R”, by Examples 3 
and 4, is the union of open sets. Hence by the proposition just proved it is open, and 
the sphere S(a; 7) is closed in R”. 


Definition 4 An open set in R” containing a given point is called a neighborhood 
of that point in R’”. 

In particular, as follows from Example 3, the 5-neighborhood of a point is a 
neighborhood of it. 


Definition 5 In relation to a set E C R” a point x € R” is 


an interior point if some neighborhood of it is contained in E; 
an exterior point if it is an interior point of the complement of E in R”; 
a boundary point if it is neither an interior point nor an exterior point. 


It follows from this definition that the characteristic property of a boundary point 
of a set is that every neighborhood of it contains both points of the set and points 
not in the set. 


Example 7 The sphere S(a;r), r > 0 is the set of boundary points of both the open 
ball B(a; r) and the closed ball B(a; r). 


Example & A point a € R” is a boundary point of the set R’”\a, which has no 
exterior points. 


Example 9 All points of the sphere S(a; r) are boundary points of it; regarded as a 
subset of R”, the sphere S(a; r) has no interior points. 
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Definition 6 A point a € R” is a limit point of the set E C R” if for any neighbor- 
hood O(a) of a the intersection E 1 O(a) is an infinite set. 


Definition 7 The union of a set E and all its limit points in R” is the closure of E 
in R”. 


The closure of the set E is usually denoted E. 


Example 10 The set B(a;r) = B(a; r)US(a; r) is the set of limit points of the open 
ball B(a; r); that is why B(a;1r), in contrast to B(a; r), is called a closed ball. 


Example 11 S(a:r) = S(azr). 


Rather than proving this last equality, we shall prove the following useful propo- 
sition. 


Proposition 2 (F is closed in R”) & (F = F inR"). 
In other words, F is closed in R” if and only if it contains all its limit points. 


Proof Let F be closed in R”, x € R”, and x ¢ F. Then the open set G = R”\F 
is a neighborhood of x that contains no points of F. Thus we have shown that if 
x ¢ F, then x is not a limit point of F. 

Let F = F. We shall verify that the set G = R” \ F is open in R’”. If x € G, then 
x ¢ F, and therefore x is not a limit point of F. Hence there is a neighborhood of 
x containing only a finite number of points x1,...,x, of F. Since x ¢ F’, one can 
construct, for example, balls about x, Oi (x), ..., On(x) such that x; ¢ Oj(x). Then 
O(x) = (}_, Oj (x) is an open neighborhood of x containing no points of F at all, 
that is, O(x) C R™\F and hence the set R”\F = R”\F is open. Therefore F is 
closed in R”. 


7.1.3 Compact Sets in IR” 


Definition 8 A set K C R” is compact if from every covering of K by sets that are 
open in R” one can extract a finite covering. 


Example 12 A closed interval [a,b] C R! is compact by the finite covering lemma 
(Heine—Borel theorem). 


Example 13 A generalization to R” of the concept of a closed interval is the set 


f={xeR" |a' <x' <5', a Pee 4 
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which is called an m-dimensional interval, or an m-dimensional block or an m- 
dimensional parallelepiped. 
We shall show that J is compact in R”. 


Proof Assume that from some open covering of J one cannot extract a finite cov- 
ering. Bisecting each of the coordinate closed intervals J! = {x! € R: a! < x! < b'} 
(i =1,...,m), we break the interval J into 2” intervals, at least one of which 
does not admit a covering by a finite number of sets from the open system we 
started with. We proceed with this interval exactly as with the original inter- 
val. Continuing this division process, we obtain a sequence of nested intervals 
IT=hDhhD---DI,D---, none of which admits a finite covering. If J, = {x € 
R” | a <xi< ie i,...,m}, then for each i € {1, ...,m} the coordinate closed in- 
tervals ai, <xi< ws (n = 1,2,...) form, by construction, a system of nested closed 
intervals whose lengths tend to zero. By finding the point €! € [ai b',] common to 
all of these intervals for each i € {1,...,m}, we obtain a point € = (é!,..., &’”) 
belonging to all the intervals J = 1), In,..., In,.... Since & € J, there is an open 
set G in the system of covering sets such that € € G. Then for some 5 > 0 we also 
have B(é; 6) C G. But by construction and the relation (7.2) there exists N such that 
I, C B(é; 5) C G forn > N. We have now reached a contradiction with the fact that 
the intervals J, do not admit a finite covering by sets of the given system. 


Proposition 3 [f K is a compact set in R”, then 


a) K is closed in R"; 
b) any closed subset of IR” contained in K is itself compact. 


Proof a) We shall show that any point a € R” that is a limit point of K must belong 
to K. Suppose a ¢ K. For each point x € K we construct a neighborhood G(x) 
such that a has a neighborhood disjoint from G(x). The set {G(x)}, x € K, consist- 
ing of all such neighborhoods forms an open covering of the compact set K, from 
which we can select a finite covering G(x1),..., G(x»). If now O;(a) is a neigh- 
borhood of a such that G(x;) N O; (a) = @, then the set O(a) = (Wea Oj (a) is also 
a neighborhood of a, and obviously K M O(a) = ©. Thus a cannot be a limit point 
of K. 

b) Suppose F is a closed subset of R” and F Cc K. Let {Gy}, a € A, be a cov- 
ering of F by sets that are open in R”. Adjoining to this collection the open set 
G = R” \F, we obtain an open covering of R”, and in particular, an open covering 
of K, from which we select a finite covering of K. This finite covering of K will 
also cover the set F. Observing that GM F = ©, one can say that if G belongs to 
this finite covering, we will still have a finite covering of F by sets of the original 
system {Gy}, a € A, if we remove G. 


Definition 9 The diameter of a set E C R” is the quantity 


d(E):= sup d(xj,%2). 


X1,X2E€E 
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Definition 10 A set E C R” is bounded if its diameter is finite. 
Proposition 4 [f K is a compact set in R”, then K is a bounded subset of R’’. 


Proof Take an arbitrary point a € R” and consider the sequence of open balls 
{B(a;n)} (n=1,2,...). They form an open covering of R” and consequently also 
of K.If K were not bounded, it would be impossible to select a finite covering of 
K from this sequence. 


Proposition 5 The set K C R” is compact if and only if K is closed and bounded 
in R”. 


Proof The necessity of these conditions was proved in Propositions 3 and 4. 

Let us verify that the conditions are sufficient. Since K is a bounded set, there 
exists an m-dimensional interval J containing K. As was shown in Example 13, 
I is compact in R”. But if K is a closed set contained in the compact set /, then by 
Proposition 3b) it is itself compact. 


7.1.4 Problems and Exercises 


1. The distance d(E,, Ez) between the sets E,, Ez C R” is the quantity 


d(E£\, Eo) := inf d(x, X2). 


x1, EE) ,x2EE2 


Give an example of closed sets E; and E2 in R” having no points in common for 
which d(E}, Ez) =0. 
2. Show that 


a) the closure E in R” of any set E C R” is a closed set in R’”; 
b) the set dE of boundary points of any set E C R” is a closed set; 
c) if G is an open set in R” and F is closed in R”, then G\F is open in R”. 


3. Show that if Kj D Ko D---D Ky, D--- is a sequence of nested nonempty com- 
pact sets, then ()7°, Ki 4 2. 

4, a) In the space R* a two-dimensional sphere S* and a circle S! are situated so 
that the distance from any point of the sphere to any point of the circle is the same. 
Is this possible? 


b) Consider problem a) for spheres §”, S$” of arbitrary dimension in R*. Under 
what relation on m, n, and k is this situation possible? 
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7.2 Limits and Continuity of Functions of Several Variables 


7.2.1 The Limit of a Function 


In Chap. 3 we studied in detail the operation of passing to the limit for a real-valued 
function f : X — R defined on a set in which a base B was fixed. 

In the next few sections we shall be considering functions f : X — R” defined 
on subsets of IR” with values in R = R! or more generally in R”, n € N. We shall 
now make a number of additions to the theory of limits connected with the specifics 
of this class of functions. 

However, we begin with the basic general definition. 


Definition 1 A point A € R” is the limit of the mapping f : X > R" over a base B 
in X if for every neighborhood V (A) of the point there exists an element B € B of 
the base whose image f(B) is contained in V (A). 


In brief, 
(lim f(@) = A) = (VV (A) IBeB (f(B) Cc V(A))). 


We see that the definition of the limit of a function f : X — R” is exactly the 
same as the definition of the limit of a function f : X — R if we keep in mind what 
a neighborhood V (A) of a point A € R” is for every n EN. 


Definition 2 A mapping f : X — R” is bounded if the set f(X) C R” is bounded 
in R”. 


Definition 3 Let B be a base in X. A mapping f : X > R” is ultimately bounded 
over the base B if there exists an element B of B on which f is bounded. 


Taking these definitions into account and using the same reasoning that we gave 
in Chap. 3, one can verify without difficulty that 


a function f : X — R" can have at most one limit over a given base B in X; 
a function f : X — R" having a limit over a base B is ultimately bounded over 
that base. 


Definition | can be rewritten in another form making explicit use of the metric 
in R”, namely 


Definition 1’ 
(lim f@) =Ae R") := (Ve > 04B € Bx € B (d(f (x), A) <e)) 


or 
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Definition 1” 
(lim f(x)=Ae R") = (lim(a( oo. A)) = 0). 


The specific property of a mapping f : X — R” is that, since a point y € R” is 
an ordered n-tuple (y!,..., y”) of real numbers, defining a function f : X > R” 
is equivalent to defining n real-valued functions f' : X > R (i = 1,...,n), where 
fiay=y! G@=1,...,n). 

If A=(A!,..., A”) and y =(y!,..., y”), we have the inequalities 


|v — A'| <d(y, A) < vin max |y' — A'J, (7.3) 
l<i<n 
from which one can see that 
lim f(x) = A @ lim f'(x) = A! (i=1,...,n), (7.4) 


that is, convergence in R” is coordinatewise. 
Now let X = N be the set of natural numbers and 6 the base k > o0, k EN, 
in X. A function f : N > R” in this case is a sequence {yz}, k € N, of points of R”. 


Definition 4 A sequence {yx}, k € N, of points yg € R” is fundamental (a Cauchy 
sequence) if for every ¢ > 0 there exists a number N € N such that d(yx,, yx.) < € 
for all kj,kg > N. 

One can conclude from the inequalities (7.3) that a sequence of points y; = 
(y}, ..+, ¥p) € R" is a Cauchy sequence if and only if each sequence of coordinates 
having the same labels {yi}, keN,i=1,...,n, is a Cauchy sequence. 

Taking account of relation (7.4) and the Cauchy criterion for numerical se- 
quences, one can now assert that a sequence of points IR” converges if and only 
if it is a Cauchy sequence. 

In other words, the Cauchy criterion is also valid in R”. 

Later on we shall call metric spaces in which every Cauchy sequence has a limit 
complete metric spaces. Thus we have now established that R” is a complete metric 
space for every n EN. 


Definition 5 The oscillation of a function f : X > R” onaset E C X is the quan- 
tity 

w(f; E):=d(f(E)), 
where d(f (E)) is the diameter of f(E). 


As one can see, this is a direct generalization of the definition of the oscillation 
of a real-valued function, which Definition 5 becomes when n = 1. 

The validity of the following Cauchy criterion for the existence of a limit for 
functions f : X — R” with values in R” results from the completeness of R”. 
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Theorem 1 Let X be a set and B a base in X. A function f : X — R" has a limit 
over the base B if and only if for every ¢ > 0 there exists an element B € B of the 
base on which the oscillation of the function is less than e. 

Thus, 


Alim f(x) @ Ve > 04B € B(o(f; B) <e). 


The proof of Theorem | is a verbatim repetition of the proof of the Cauchy crite- 
rion for numerical functions (Theorem 4 in Sect. 3.2), except for one minor change: 
| f (x1) — f (x2)| must be replaced throughout by d(f (x1), f(x2)). 

One can also verify Theorem | another way, regarding the Cauchy criterion as 
known for real-valued functions and using relations (7.4) and (7.3). 

The important theorem on the limit of a composite function also remains valid 
for functions with values in R”. 


Theorem 2 Let Y be a set, By a base in Y, and g : Y > R" a mapping having a 
limit over the base By. 

Let X be a set, Bx a base in X, and f : X — Y amapping of X into Y such that 
for each By € By there exists By € Bx such that the image f (Bx) is contained 
in By. 

Under these conditions the composition g o f : X — R" of the mappings f and 
g is defined and has a limit over the base Bx, and 


lim(g 0 f )(x) = lim g(9). 


The proof of Theorem 2 can be carried out either by repeating the proof of Theo- 
rem 5 of Sect. 3.2, replacing R by R”, or by invoking that theorem and using relation 
(7.4). 

Up to now we have considered functions f : X — R” with values in R”, without 
specifying their domains of definition X in any way. From now on we shall primarily 
be interested in the case when X is a subset of R”. 

As before, we make the following conventions. 


U(a) is a neighborhood of the point a € R”; 

U (a) is a deleted neighborhood of a € R’”, that is, U(a) := U(a)\a; 

Uz(a) is a neighborhood of a in the set E C R”, that is, Ug(a) := ENU(a); 

Up (a) is a deleted neighborhood of a in E, that is, Ug (a) := EN U(a); 

x — ais the base of deleted neighborhoods of a in R”; 

x — ov is the base of neighborhoods of infinity, that is, the base consisting of the 
sets R”\ B(a;r); 

x —>a,x € E or (E > x — a) is the base of deleted neighborhoods of a in E if 
a is a limit point of EF; 

x > oc, x € E or (E 5 x > ov) is the base of neighborhoods of infinity in E 
consisting of the sets E\ B(a; r), if E is an unbounded set. 
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In accordance with these definitions, one can, for example, give the following 
specific form of Definition | for the limit of a function when speaking of a function 
f :£— R” mapping a set E C R” into R”: 


(lim f(x) =A) = (Ve > 030 2(@) Vx € g(a) (d(f(@), A) <2). 


E>x—>a 


The same thing can be written another way: 
( lim f= A) = 
E>x—>a 
= (Ve > 055 > OVx € E (0 <d(x,a) <5 >d(f(x), A) <8)). 
Here it is understood that the distances d(x, a) and d(f (x), A) are measured in the 


spaces (R™ and R”) in which these points lie. 
Finally, 


(Jim. f(x)= A) := (Ve > 0 AB(a;r) Vx € R”\ Ba; r) (d(f (x), A) <e)). 


Let us also agree to say that, in the case of a mapping f : X — R”, the phrase 
““f (x) = oo in the base 6” means that for any ball B(A; 7) C R” there exists B € B 
of the base G such that f(B) C R”\B(A;r). 


Example 1 Let x +> m'(x) be the mapping z! : R” — R assigning to each x = 
(x!,..., x") in R™ its ith coordinate x‘. Thus 
m'i(x) =x!. 
Ifa =(a!,..., a’), then obviously 
m'(x) —+@ ae u. 


The function x ++ 2/(x) does not tend to any finite value nor to infinity as 
x>oifm>l. 
On the other hand, 


m 


f= S (zit)? >0o asx oo. 


i=l 


One should not think that the limit of a function of several variables can be found 
by computing successively the limits with respect to each of its coordinates. The 
following examples show why this is not the case. 


Example 2 Let the function f : R* — R be defined at the point (x, y) € R? as fol- 
lows: 
ates Zay ifa? ty? #0, 
x, y= 
0, if x7 + y*=0. 
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Then f (0, y) = f(x, 0) =0, while f(x, x)= 5 for x £0. 
Hence this function has no limit as (x, y) > (0, 0). 


On the other hand, 
eee) eR Ore 
Ce) ae 


Example 3 For the function 


Ji 29, : 
AG ena UO ES AO 
0, if x? + y? =0, 


we have 


x2 
Jim (im, fe, »)= lees aa 


2 
ee 


Example 4 For the function 


rica oe ada if x £0, 


if x =0, 
we have 
XX, y= 
(x, pate 0) fay 
lim (tim, f(x, re 
x0 


yet at the same time the iterated limit 


lim (lim f(x, v)) 


y>0\x>0 


does not exist at all. 


Example 5 The function 
2 . 
jena | aoe He 40 
0, if x7 + y* =0, 


has a limit of zero upon approach to the origin along any ray x = at, y = ft. 
At the same time, the function equals 5 at any point of the form (a, a*), where 
a # 0, and so the function has no limit as (x, y) > (0, 0). 


7.2 Limits and Continuity of Functions of Several Variables 421 


7.2.2 Continuity of a Function of Several Variables and Properties 
of Continuous Functions 


Let E be a subset of R” and f : E > R” a function defined on E with values in R”. 


Definition 6 The function f : E > R” is continuous at a € E if for every neigh- 
borhood V(f(a)) of the value f(a) that the function assumes at a, there exists a 
neighborhood U;(a) of a in E whose image f(Ug(a)) is contained in V(f (a)). 


Thus 
(f : E > R" is continuous at a € E) — 
= (WV (f(@)) Ue (a) (f(Ue(@)) c V(F@))). 


We see that Definition 6 has the same form as Definition | for continuity of a 
real-valued function, which we are familiar with from Sect. 4.1. As was the case 
there, we can give the following alternate expression for this definition: 


(f : E > R" is continuous at a € E) = 


= (Ve > 055 > Ox € E (d(x, a) <5 > d(f(x), f(a) <8)), 


or, if a is a limit point of E, 
(f : E> R" is continuous at a € E) := ( lim f@)= f(a)). 
E>x—a 


As noted in Chap. 4, the concept of continuity is of interest precisely in connec- 
tion with a point a € E that is a limit point of the set E on which the function f is 
defined. 

It follows from Definition 6 and relation (7.4) that the mapping f : E > R” 
defined by the relation 


ee x" axrSya(yl, = 
Se skeen ined 1 wise); 
is continuous at a point if and only if each of the functions y’ = f‘(x!,...,x’”) is 


continuous at that point. 
In particular, we recall that we defined a path in R” to be a mapping f : J > R” 
of an interval J C R defined by continuous functions f!(x),..., f”(x) in the form 


0S 9= 6 IIS Oycf"@). 


Thus we can now say that a path in R” is a continuous mapping of an interval 
I CR of the real line into R”. 
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By analogy with the definition of oscillation of a real-valued function at a point, 
we introduce the concept of oscillation at a point for a function with values in R”. 
Let E be a subset of R”, a € E, and Bge(a;r) = EN B(a;r). 


Definition 7 The oscillation of the function f : E > R’ at the point a € E is the 
quantity 


o(fia):= jim o(f: Bge(a;r)). 


From Definition 6 of continuity of a function, taking account of the properties of 
a limit and the Cauchy criterion, we obtain a set of frequently used local properties 
of continuous functions. We now list them. 


Local Properties of Continuous Functions 


a) Amapping f : E > R” ofaset E C R” is continuous at a point a € E if and 
only if w(f;a) =0. 

b) A mapping f : E — R" that is continuous ata € E is bounded in some neigh- 
borhood Ug, (a) of that point. 

c) If the mapping g : Y — R* of the set Y C R" is continuous at a point yo € Y 
and the mapping f : X — Y of the set X CR” is continuous at a point xo € X and 
f (xo) = yo, then the mapping go f : X > R* is defined, and it is continuous at 
x9 EX. 

Real-valued functions possess, in addition, the following properties. 

d) If the function f : E — R is continuous at the point a € E and f(a) > 0 (or 
f(a) <0), there exists a neighborhood Ur(a) of a in E such that f (x) > 0 (resp. 
f(x) <0) for all x € Ug(a). 

e) If the functions f : E > R and g: E — R are continuous at a € E, then 
any linear combination of them (af + Bg): E > R, where a, B € R, their product 
(f-g):E—R, and, if g(x) £0 on E, their quotient (£) : E > R are defined on 
E and continuous at a. 


Let us agree to say that the function f : E > R” is continuous on the set E if it 
is continuous at each point of the set. 

The set of functions f : E — R” that are continuous on E will be denoted 
C(E, R”) or simply C(£), if the range of values of the functions is unambiguously 
determined from the context. As a rule, this abbreviation will be used when R” = R. 


Example 6 The functions (xt, oe.) > xi (i = 1,...,m), mapping R” onto R 
(projections) are obviously continuous at each point a = (a!,..., a’) € R”, since 
limy +g 7!(x) =ai =n‘ (a). 

Example 7 Any function x + f(x) defined on R, for example x +> sinx, can also 


be regarded as a function (x, y) ee, f (x) defined, say, on R?. In that case, if f was 
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continuous as a function on R, then the new function (x, y) *, Ff (x) will be con- 
tinuous as a function on R?. This can be verified either directly from the definition 
of continuity or by remarking that the function F is the composition (f o2!)(x, y) 
of continuous functions. 

In particular, it follows from this, when we take account of c) and e), that the 
functions 


f(x,y) =sinx +e”, f (x, y) = arctan(In(|x| + ly] + 1)), 


for example, are continuous on R?. 

We remark that the reasoning just used is essentially local, and the fact that the 
functions f and F studied in Example 7 were defined on the entire real line R or 
the plane R? respectively was purely an accidental circumstance. 


Example 8 The function f(x, y) of Example 2 is continuous at any point of the 
space IR? except (0,0). We remark that, despite the discontinuity of f(x, y) at this 
point, the function is continuous in either of its two variables for each fixed value of 
the other variable. 


Example 9 If a function f : E > R” is continuous on the set E and E is a subset 
of E, then the restriction f|; of f to this subset is continuous on E, as follows 
immediately from the definition of continuity of a function at a point. 


We now turn to the global properties of continuous functions. To state them for 
functions f : E > R", we first give two definitions. 


Definition 8 A mapping f : E — R” of aset E C R” into R” is uniformly contin- 
uous on E if for every ¢ > 0 there is a number 6 > 0 such that d(f (x1), f(x2)) <e 
for any points x;, x2 € E such that d(x1, x2) <6. 


As before, the distances d(x1, x2) and d(f (x1), f(x2)) are assumed to be mea- 
sured in R” and R” respectively. 

When m =n = 1, this definition is the definition of uniform continuity of 
numerical-valued functions that we already know. 


Definition 9 A set E C R” is pathwise connected if for any pair of its points xo, 
x1, there exists a path I”: J > E with support in E and endpoints at these points. 


In other words, it is possible to go from any point x9 € E to any other point 
x, € E without leaving EF. 

Since we shall not be considering any other concept of connectedness for a set 
except pathwise connectedness for the time being, for the sake of brevity we shall 
temporarily call pathwise connected sets simply connected. 


Definition 10 A domain in R” is an open connected set. 
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Example 10 An open ball B(a;r), r > 0, in R” is a domain. We already know that 
B(a;r) is open in R”. Let us verify that the ball is connected. Let x9 = (x4 wig) 
and x; = (x i: ...,X]") be two points of the ball. The path defined by the functions 
x!(t)= txt + (1 —1)x) G@ =1,...,m), defined on the closed interval 0 <r < 1, has 
Xo and x; as its endpoints. In addition, its support lies in the ball B(a; r), since, by 
Minkowski’s inequality, for any t € [0, 1], 


d(x(t),a)= | So (x' al) = | V((x} a’) + — (x) - a’) < 
\ i=! \ i=l 


m m 


(1 —H(x)-4'))" = 


e 


Example 11 The circle (one-dimensional sphere) of radius r > 0 is the subset of R? 
given by the equation (x!)* + (x*)* =r’. Setting x! =rcost, x7 =rsint, we see 
that any two points of the circle can be joined by a path that goes along the circle. 
Hence a circle is a connected set. However, this set is not a domain in R?, since it is 
not open in R?. 


We now state the basic facts about continuous functions in the large. 


Global Properties of Continuous Functions 


a) Ifa mapping f : K — R" is continuous on a compact set K CR", then it is 
uniformly continuous on K. 

b) Ifa mapping f : K — R" is continuous on a compact set K CR", then it is 
bounded on K. 

c) If a function f : K — R is continuous on a compact set K C R"”, then it 
assumes its maximal and minimal values at some points of K. 

d) Ifa function f : E — R is continuous on a connected set E and assumes the 
values f(a) =A and f(b) = B at points a,b € E, then for any C between A and 
B, there is a point c € E at which f(c)=C. 


Earlier (Sect. 4.2), when we were studying the local and global properties of 
functions of one variable, we gave proofs of these properties that extend to the more 
general case considered here. The only change that must be made in the earlier 
proofs is that expressions of the type |x; — x2| or | f (x1) — f (%2)| must be replaced 
by d(x1,x2) and d(f (x1), f(x2)), where d is the metric in the space where the 
points in question are located. This remark applies fully to everything except the 
last statement d), whose proof we now give. 
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Proof ad) Let I.: I > E be a path that is a continuous mapping of an interval 
[a, 6] =1C R such that (a) =a, I'(f) = b. By the connectedness of E there 
exists such a path. The function f o I”: J > R, being the composition of contin- 
uous functions, is continuous; therefore there is a point y € [a, 6] on the closed 
interval [a, 6] at which fol'(y) =C. Setc=TI(y). Thenc € E and f(c) =C. 


Example 12 The sphere S(0; r) defined in R” by the equation 
Gy roar (x)? 7 r2, 


is a compact set. 
Indeed, it follows from the continuity of the function 


(x',...,2") Ge) cet (x)? 


that the sphere is closed, and from the fact that |x! | <r(i=1,...,m) on the sphere 
that it is bounded. 
The function 


Ce, : are) = (cy eg ae (x*)? _ (xkt1)? ae (x)? 


is continuous on all of IR”, so that its restriction to the sphere is also continuous, 
and by the global property c) of continuous functions assumes its minimal and max- 
imal values on the sphere. At the points (1,0,...,0) and (0,...,0, 1) this function 
assumes the values | and —1 respectively. By the connectedness of the sphere (see 
Problem 3 at the end of this section), global property d) of continuous functions now 
enables us to assert that there is a point on the sphere where this function assumes 
the value 0. 


Example 13 The open set R”\S(0;r) for r > 0 is not a domain, since it is not 
connected. 

Indeed, if I’ : J — R” is a path one end of which is at the point x9 = (0,..., 0) 
and the other at some point x; = Gas ...,X7") such that Gd)? peer tf Gaye > Pr, 
then the composition of the continuous functions : 7 > R” and f : R” > R, 
where 


(x) - xe) S (xh)? feet a"), 


is a continuous function on J assuming values less than r? at one endpoint and 
greater than r? at the other. Hence there is a point y on J at which (fo )(y) =r’. 
Then the point x, = I’(y) in the support of the path turns out to lie on the sphere 
S(0; 7). We have thus shown that it is impossible to get out of the ball B(O; r) C R” 
without intersecting its boundary sphere S(0; 7). 
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7.2.3 Problems and Exercises 


1. Let f € C(R”; R). Show that 


a) the set EF} = {x € R” | f(x) <c} is open in R”; 

b) the set Ep = {x € R” | f(x) <c} is closed in R”; 

c) the set £3 = {x € R” | f(x) =c} is closed in R”; 

d) if f(x) > +00 as x > ov, then E2 and F3 are compact in R”; 

e) for any f :R” — R the set Ey = {x € R” | w(f; x) => &€} is closed in R”. 


2. Show that the mapping f : R” — R” is continuous if and only if the preimage 
of every open set in R” is an open set in R”. 
3. Show that 


a) the image f(£) of a connected set E Cc R” under a continuous mapping 
f : E > R’ is aconnected set; 

b) the union of connected sets having a point in common is a connected set; 

c) the hemisphere (x!)* + ---+ («’”)* = 1, x” > 0, is a connected set; 

d) the sphere (x!)2 +.---+ (x)? = 1 is a connected set; 

e) if E CR and E is connected, then E is an interval in R (that is, a closed 
interval, a half-open interval, an open interval, or the entire real line); 

f) if xo is an interior point and x; an exterior point in relation to the set M Cc R”, 
then the support of any path with endpoints xo, x; intersects the boundary of the 
set M. 


Chapter 8 
The Differential Calculus of Functions of Several 
Variables 


8.1 The Linear Structure on R” 


8.1.1 IR” as a Vector Space 


The concept of a vector space is already familiar to you from your study of algebra. 


If we introduce the operation of addition of elements x; = Gets eats xi) and x2 = 
Ge, ...,x5') in R” by the formula 
_ 1 1 m m 
xy tx2 = (xj $.29,...,47 + x3'), (8.1) 
and multiplication of an element x = (x!,..., x”) by a number A € R via the rela- 
tion 
DESO ede); (8.2) 


then R’” becomes a vector space over the field of real numbers. Its points can now 
be called vectors. 
The vectors 


B26 ).2:60:41,6,...0) GSlLca30 (8.3) 


(where the | stands only in the ith place) form a maximal linearly independent set 
of vectors in this space, as a result of which it turns out to be an m-dimensional 
vector space. 

Any vector x € R” can be expanded with respect to the basis (8.3), that is, rep- 
resented in the form 


x=xleyt--- +x em. (8.4) 


When vectors are indexed, we shall write the index as a subscript, while denoting 
its coordinates, as we have been doing, by superscripts. This is convenient for many 
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reasons, one of which is that, following Einstein,! we can make the convention of 
writing expressions like (8.4) briefly in the form 


x=x'e;, (8.5) 


taking the simultaneous presence of subscript and superscript with the same letter 
to indicate summation with respect to that letter over its range of variation. 


8.1.2 Linear Transformations L :IR™ — IR" 


We recall that a mapping L : X — Y from a vector space X into a vector space Y is 
called linear if 
LAX] +A2x2) = AL (x1) + AQL (x2) 
for any x1,x2 € X, and A1;,A2 € R. We shall be interested in linear mappings L : 
R” > R’. 
If {e1,...,@m} and {é1,..., @,} are fixed bases of R” and R” respectively, then, 
knowing the expansion 


L(e;)) =a}é) +++ +.até, =alé; (Gi =1,...,m) (8.6) 


of the images of the basis vectors under the linear mapping L : R” — R”, we can 
use the linearity of L to find the expansion of the image L(h) of any vector h = 
hie, +--+ +h™em = hie; in the basis {é1,...,&,}. To be specific, 


L(h) = L(h'e;) =h'L(e) = hiajé; = aj h'é;. (8.7) 
Hence, in coordinate notation: 
L(h) = (a}h',..., ath’), (8.8) 
For a fixed basis in R” the mapping L : R” — R” can thus be regarded as a set 
bE oul") (8.9) 


of n (coordinate) mappings L/ : R” > R. 
Taking account of (8.8), we easily conclude that a mapping L : R” > R" is 
linear if and only if each mapping L/ in the set (8.9) is linear. 
If we write (8.9) as a column, taking account of relation (8.8), we have 
L|(h) at +++ ay\ (hh! 
Lith) = : en : |. (8.10) 
L"(h) a h™ 


m 


‘A. Einstein (1879-1955) — greatest physicist of the twentieth century. His work in quantum theory 
and especially in the theory of relativity exerted a revolutionary influence on all of modern physics. 
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Thus, fixing bases in R” and R” enables us to establish a one-to-one corre- 
spondence between linear transformations L : R” — R” and m x n-matrices (a! ) 
(i =1,...,m, j =1,...,n). When this is done, the ith column of the matrix (a!) 
corresponding to the transformation L consists of the coordinates of L(e;), the im- 
age of the vector e; € {e1,..., em}. The coordinates of the image of an arbitrary 
vector h = h‘e; € R” can be obtained from (8.10) by multiplying the matrix of the 
linear transformation by the column of coordinates of h. 

Since R” has the structure of a vector space, one can speak of linear combinations 
Ai fi + A2f2 of mappings f; : X > R” and f2: X > R", setting 


(Ar fi + A2f2)(x) = A1 fi(®) + A2 f2(*). (8.11) 


In particular, a linear combination of linear transformations L,; : R’ — R® and 
L2:R” — R" is, according to the definition (8.11), a mapping 


hry AL (h) + A2L2(A) = L(A), 


which is obviously linear. The matrix of this transformation is the corresponding 
linear combination of the matrices of the transformations L; and L2. 

The composition C = B o A of linear transformations A: R” — R” and B: 
R"’ > R* is obviously also a linear transformation, whose matrix, as follows from 
(8.10), is the product of the matrix of A and the matrix of B (which is multiplied on 
the left). Actually, the law of multiplication for matrices was defined in the way you 
are familiar with precisely so that the product of matrices would correspond to the 
composition of the transformations. 


8.1.3 The Norm in R™ 


The quantity 


Ill = (4)? + + Gen)? 8.12) 


is called the norm of the vector x = (x!,...,x’") €R”. 
It follows from this definition, taking account of Minkowski’s inequality, that 


1° |x|] =0, 

2° (\|x|| =0) > (x =0), 

3° ||Ax|| =A] - llx|l, where A eR, 
Ae |lxq + x2] < [eri] + [lxell. 


Oo 


In general, any function || || : ¥ — R on a vector space X satisfying condi- 
tions 1°-4° is called a norm on the vector space. Sometimes, to be precise as to 
which norm is being discussed, the norm sign has a symbol attached to it to denote 
the space in which it is being considered. For example, we may write ||x||p” or 
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lly lle. As a rule, however, we shall not do that, since it will always be clear from 
the context which space and which norm are meant. 
We remark that by (8.12) 


x2 — xy || = d (x1, x2), (8.13) 


where d(x1, x2) is the distance in R” between the vectors x; and x2, regarded as 
points of R”. 
It is clear from (8.13) that the following conditions are equivalent: 


x —> XQ, d(x, x0) > 0, |x — xo|| > 0. 
In view of (8.13), we have, in particular, 
I|x|| = d(0, x). 


Property 4° of a norm is called the triangle inequality, and it is now clear why. 
The triangle inequality extends by induction to the sum of any finite number of 
terms. To be specific, the following inequality holds: 


[ler + ++ + xxl] S [oall +--+ + Weel. 


The presence of the norm of a vector enables us to compare the size of values of 
functions f : X > R” and g: X > R". 

Let us agree to write f(x) = o(g(x)) or f = o0(g) over a base B in X if 
Il f@) len = o(||g@) |p") over the base B. 

If f~=(f lox), ..., ft (x)) is the coordinate representation of the mapping 
f :X — R”, then in view of the inequalities 


If @|< | FO] <>] @| (8.14) 
i=l 


one can make the following observation, which will be useful below: 
(f =0(g) over the base B) > ‘Cu = 0(g) over the base B; i =1,...,m). (8.15) 


We also make the convention that the statement f = O(g) over the base B in X 
will mean that || f(x) ||Rp~ = O(||g(x) ||) over the base B. 
We then obtain from (8.14) 


(f = O(g) over the base B) cs (f' = O(g) over the base B; i=1,. ..,m). 
(8.16) 


Example Consider a linear transformation L : R” — R”. Leth =h'e; +--+ 
h™ em be an arbitrary vector in R”. Let us estimate || L(/)||Rn: 


SPU AC)) = SJeeal}| = (Select | (8.17) 
i=l i=l i=l 


|Z0| = 
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Thus one can assert that 


Lih)=O(h) ash 0. (8.18) 


In particular, it follows from this that L(x — x9) = L(x) — L(xo) > Oas x > x0, 
that is, a linear transformation L : R’” — R” is continuous at every point x9 € R”. 
From estimate (8.17) it is even clear that a linear transformation is uniformly con- 
tinuous. 


8.1.4 The Euclidean Structure on R™ 


The concept of the inner product in a real vector space is known from algebra as 
a numerical function (x, y) defined on pairs of vectors of the space and possessing 
the properties 


(x,x) 20, 
(x,x)=0¢6x=0, 
(x1, *2) = (x2,%1), 
(Ax1,%2) =A(x1,x2), WwhererA ER, 
(x1 + x2, %3) = (x1, %3) + (x2, X3). 


It follows in particular from these properties that if a basis {e1,..., @m} is fixed 
in the space, then the inner product (x, y) of two vectors x and y can be expressed 
in terms of their coordinates (x!,...,x’”) and (y!,..., y”) as the bilinear form 


(x,y) = gijx'y/ (8.19) 


(where summation over i and j is understood), in which g;; = (e;, €;). 
Vectors are said to be orthogonal if their inner product equals 0. 


A basis {e1, ..-, €m} is orthonormal if g;; = 6;;, where 
0, ifiFs, 
bij = Seat 
1, ifi=j. 


In an orthonormal basis the inner product (8.19) has the very simple form 
(x, y) = d:jx!y/, 
or 
py ag yl beeetia ey”, (8.20) 


Coordinates in which the inner product has this form are called Cartesian coor- 
dinates. 
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We recall that the space R” with an inner product defined in it is called Euclidean 
space. 

Between the inner product (8.20) and the norm of a vector (8.12) there is an 
obvious connection 


(x, x) = fla’. 
The following inequality is known from algebra: 
(x,y)? < (x, x){y, y). 


It shows in particular that for any pair of vectors there is an angle g € [0, x] such 
that 


(x, y) = |lIIIlyll cos @. 


This angle is called the angle between the vectors x and y. That is the reason we 
regard vectors whose inner product is zero as orthogonal. 

We shall also find useful the following simple, but very important fact, known 
from algebra: 

any linear function L : R” — R in Euclidean space has the form 


L(x) = (, x), 


where € € R” is a fixed vector determined uniquely by the function L. 


8.2. The Differential of a Function of Several Variables 
8.2.1 Differentiability and the Differential of a Function at a Point 


Definition 1 A function f : E — R” defined on a set E C R” is differentiable at 
the point x € E, which is a limit point of E, if 


fxth)— f@~)=L@A+a(xzh), (8.21) 


where L(x) : R”™ — R" is a function” that is linear in h and a(x; h) = o(h) as 
h>O,x+heeE. 
The vectors 


Ax(h) := (x +h) —x=hA, 
Af (x; h):= f(x +h) — f(x) 


By analogy with the one-dimensional case, we allow ourselves to write L(x)h instead of L(x)(h). 
We note also that in the definition we are assuming that R” and R” are endowed with the norm of 
Sect. 8.1. 
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are called respectively the increment of the argument and the increment of the func- 
tion (corresponding to this increment of the argument). These vectors are tradition- 
ally denoted by the symbols of the functions of h themselves Ax and Af (x). The 
linear function L(x) : R” — R” in (8.21) is called the differential, tangent mapping, 
or derivative mapping of the function f : E > R” at the point x € E. 

The differential of the function f : E — R” at a point x € E is denoted by the 
symbols df (x), Df (x), or f’(x). 

In accordance with the notation just introduced, we can rewrite relation (8.21) as 


fx +h) — fx) = f'@)h + ax; h) 
or 


Af (x;h) =df(x)h + a(x; h). 


We remark that the differential is defined on the displacements h from the point 
xeR”. 

To emphasize this, we attach a copy of the vector space R” to the point x € R” 
and denote it 7,..R”, TIR” (x), or TR”. The space TR” can be interpreted as a set 
of vectors attached at the point x ¢ R”. The vector space TR” is called the tangent 
space to R” at x € R”. The origin of this terminology will be explained below. 

The value of the differential on a vector h € TR” is the vector f’(x)h € TR’, te 
attached to the point f(x) and approximating the increment f(x + h) — f(x) of the 
function caused by the increment h of the argument x. 

Thus df (x) or f’(x) is a linear transformation f’(x): TR” > TR: 

We see that, in complete agreement with the one-dimensional case that we stud- 
ied, a vector-valued function of several variables is differentiable at a point if its 
increment Af (x;/) at that point is linear as a function of 4 up to the correction 
term a(x;h), which is infinitesimal as h — 0 compared to the increment of the 
argument. 


8.2.2 The Differential and Partial Derivatives of a Real- Valued 
Function 


If the vectors f(x +h), f(x), L(x)h, a(x; h) in R” are written in coordinates, 
Eq. (8.21) becomes equivalent to the n equalities 


fixt+h)— fi@=Li@jh+al(xi;h) (G=1,...,n) (8.22) 


between real-valued functions, in which, as follows from relations (8.9) and (8.15) 
of Sect. 8.1, L'(x) : RR” — R are linear functions and a! (x; h) = o(h) as h > 0, 
x+heE, foreveryi=1,...,n. 

Thus we have the following proposition. 
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Proposition 1 A mapping f : E > R" of a set E C R” is differentiable at a point 
x € E that is a limit point of E if and only if the functions f' : E> R(i=1,...,n) 
that define the coordinate representation of the mapping are differentiable at that 
point. 


Since relations (8.21) and (8.22) are equivalent, to find the differential L(x) of 
a mapping f : E — R” it suffices to learn how to find the differentials L'(x) of its 
coordinate functions f': E > R. 

Thus, let us consider a real-valued function f : E > R, defined on a set E Cc R” 
and differentiable at an interior point x € E of that set. We remark that in the future 
we shall mostly be dealing with the case when E is a domain in R”. If x is an 
interior point of E, then for any sufficiently small displacement h from x the point 
x +h will also belong to E,, and consequently will also be in the domain of definition 
of the function f: E > R. 

If we pass to the coordinate notation for the point x = (x!,..., x’"), the vector 
h=(h!,...,h™), and the linear function L(x)h = aj(x)h! +--+ + am(x)h”, then 
the condition 


fath)—-f@a=La)h+o(h) ash->0 (8.23) 
can be rewritten as 
F(x' hl... +h”) — f(xl. x)= 
= ay(x)h! tees tam(x)h™ +o0(h) ash—>0, (8.24) 


where a(x), ...,@m(x) are real numbers connected with the point x. 
We wish to find these numbers. To do this, instead of an arbitrary displacement h 
we consider the special displacement 


hj =h'e; =0-e, +---+0-e;-) thie; +0-ej41) +--- +0: em 


by a vector h; collinear with the vector e; of the basis {e1,..., em} in R”. 
When h = hj, it is obvious that || || = |h"|, and so by (8.24), for h = h; we obtain 


TG nt ae RO ak ene 


=aj(x)h'+o(h') ash' > 0. (8.25) 
This means that if we fix all the variables in the function f(x!,...,x’”) except 
the ith one, the resulting function of the ith variable alone is differentiable at the 


point x!. 
In that way, from (8.25) we find that 


. FOE cng Wh POSES pad E eceg 
a;(x) = lim - . 
i L 
h'> h 


(8.26) 
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Definition 2 The limit (8.26) is called the partial derivative of the function f(x) at 
the point x = (x!,...,x’") with respect to the variable x/. We denote it by one of 
the following symbols: 


of / 
sa Gf), Dif(x), fi). 
x 


Example I If f (u,v) =u? + v? sinu, then 


) 
a flu, v) = sw, v) = 3u? + v’ cosu, 
u 


02 f (u,v) = Lou, v) =2vusinu. 


Example 2 If f(x, y,z) =arctan(xy*) + e%, then 


af y? 
a ’ b} = ’ %: = = 
if, y,Z) aa vue) aaa 
of 2xy 
fa) => —_— => ——7 
af (x,y,z) ay re Tp x2y" 


of . 
BV =— hye. 
Oz 
Thus we have proved the following result. 


Proposition 2 [fa function f : E > R defined ona set E CR" is differentiable at 
an interior point x € E of that set, then the function has a partial derivative at that 
point with respect to each variable, and the differential of the function is uniquely 
determined by these partial derivatives in the form 


df(xsh= F eyn! peer of (x)h™. (8.27) 
Ox ox™ 


Using the convention of summation on an index that appears as both a subscript 
and a superscript, we can write formula (8.27) succinctly: 


df(x)h=9; f(o)h'. (8.28) 


Example 3 If we had known (as we soon will know) that the function f(x, y, z) 
considered in Example 2 is differentiable at the point (0, 1, 0), we could have written 
immediately 


df(0,1,0)h=1-h'+0-h?41-WP=h'4+r7 


and accordingly 


f(h', 1+h?,h°) — £0, 1,0) =df(, 1, 0)h + o(h) 
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or 


arctan(h!(1+h?)*) +e" =1+h' +h +0(h) ash— 0. 


Example 4 For the function x = (x!,...,x’”) > x!, which assigns to the point 
x €R” its ith coordinate, we have 


An! (x;h) = (y +h') —xi=h', 


that is, the increment of this function is itself a linear function in h :h maul h'. Thus, 
As! (x; h) = da! (x)h, and the mapping dsr‘ (x) = dz! turns out to be actually in- 
dependent of x € R” in the sense that dz‘ (x)h =h' at every point x € R”. If we 
write x! (x) instead of 2! (x), we find that dx! (x)h =dx'h =h'. 

Taking this fact and formula (8.28) into account, we can now represent the differ- 
ential of any function as a linear combination of the differentials of the coordinates 
of its argument x € R”. To be specific: 


af 


df (x) = 0; f (x) dx! = dx! of 
oxi 


heer (8.29) 


since for any vector h € TR” we have 


df (x)h = 0; f (x)h! = 0; f (x) dxth. 


8.2.3 Coordinate Representation of the Differential of a Mapping. 
The Jacobi Matrix 


Thus we have found formula (8.27) for the differential of a real-valued function 
f : E — R. But then, by the equivalence of relations (8.21) and (8.22), for any 
mapping f : E — R"” of a set E C R” that is differentiable at an interior point 
x € E, we can write the coordinate representation of the differential d f(x) as 


df! (x)h dj fl (x)h' 
df(x)h= = 
df" (x)h 3; f" (xh 
f@ -- Be) /a 
= a : = lls (8.30) 
af (x) Set SE Gg hm 


ox” 
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Definition 3 The matrix (0; f/(x)) (i =1,...,m, j =1,...,n) of partial deriva- 
tives of the coordinate functions of a given mapping at the point x € E is called the 
Jacobi matrix? or the Jacobian* of the mapping at the point. 

In the case when n = 1, we are simply brought back to formula (8.27), and when 
n=1andm=1, we arrive at the differential of a real-valued function of one real 
variable. 

The equivalence of relations (8.21) and (8.22) and the uniqueness of the differ- 
ential (8.27) of a real-valued function implies the following result. 


Proposition 3 [f a mapping f : E > R" of a set E C R"” is differentiable at an 
interior point x € E, then it has a unique differential df (x) at that point, and the 
coordinate representation of the mapping df (x): TR? > TR’) is given by rela- 
tion (8.30). 


8.2.4 Continuity, Partial Derivatives, and Differentiability 
of a Function at a Point 


We complete our discussion of the concept of differentiability of a function at a point 
by pointing out some connections among the continuity of a function at a point, the 
existence of partial derivatives of the function at that point, and differentiability at 
the point. 

In Sect. 8.1 (relations (8.17) and (8.18)) we established that if L : R’” — R” is 
a linear transformation, then Lh — 0 as h — O. Therefore, one can conclude from 
relation (8.21) that a function that is differentiable at a point is continuous at that 
point, since 


faxth)—f@=L@)hto(h) ash>0,x+he. 


The converse, of course, is not true because, as we know, it fails even in the 
one-dimensional case. 

Thus the relation between continuity and differentiability of a function at a point 
in the multidimensional case is the same as in the one-dimensional case. 

The situation is completely different in regard to the relations between partial 
derivatives and the differential. In the one-dimensional case, that is, in the case of 
a real-valued function of one real variable, the existence of the differential and the 
existence of the derivative for a function at a point are equivalent conditions. For 
functions of several variables, we have shown (Proposition 2) that differentiability of 
a function at an interior point of its domain of definition guarantees the existence of 
a partial derivative with respect to each variable at that point. However, the converse 
is not true. 


3€.G.J. Jacobi (1804-1851) — well-known German mathematician. 


4The term Jacobian is more often applied to the determinant of this matrix (when it is square). 
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Example 5 The function 


0, ifx'!x?=0, 


1 2\ __ 
ae aa if x1x2 £0, 


equals 0 on the coordinate axes and therefore has both partial derivatives at the point 
(0, 0): 


_. fth',0)—f0,0) 0-0 | 
TOO caer hi - Ae hi mi 
_ f(0,h?) — f (0,0) _ 0-0 
oF 0,0) jet h? r—+o A ° 


At the same time, this function is not differentiable at (0, 0), since it is obviously 
discontinuous at that point. 

The function given in Example 5 fails to have one of its partial derivatives at 
points of the coordinate axes different from (0, 0). However, the function 


xy : 2 2 
0, ifx?+y?=0 


(which we encountered in Example 2 of Sect. 7.2) has partial derivatives at all points 
of the plane, but it also is discontinuous at the origin and hence not differentiable 
there. 

Thus the possibility of writing the right-hand side of (8.27) and (8.28) still does 
not guarantee that this expression will represent the differential of the function we 
are considering, since the function may be nondifferentiable. 

This circumstance might have been a serious hindrance to the entire differential 
calculus of functions of several variables, if it had not been determined (as will 
be proved below) that continuity of the partial derivatives at a point is a sufficient 
condition for differentiability of the function at that point. 


8.3 The Basic Laws of Differentiation 


8.3.1 Linearity of the Operation of Differentiation 


Theorem 1 Jf the mappings f,| : E — R” and fo: E — R", defined on a set 
E C R", are differentiable at a point x € E, then a linear combination of them 
(Ay fi tA2 f2) : E > R” is also differentiable at that point, and the following equal- 
ity holds: 


(Ai fi +A2f2)' (x) = (Arf + A2f3) (x). (8.31) 
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Equality (8.31) shows that the operation of differentiation, that is, forming 
the differential of a mapping at a point, is a linear transformation on the vec- 
tor space of mappings f : E — R” that are differentiable at a given point of the 
set E. The left-hand side of (8.31) contains by definition the linear transforma- 
tion (A fi + A2f2)’(x), while the right-hand side contains the linear combination 
(A1 fy + A2f5)(x) of linear transformations f/(x) : R” — R", and fj(x):R”" > 
IR”, which, as we know from Sect. 8.1, is also a linear transformation. Theorem | 
asserts that these mappings are the same. 


Proof 


Oifi tArfaa +h) — Arfo + A2fr)(x) = 
= (Ai fie +h) + Az fox +h)) — (Ar file) + A2 fo(x)) = 
=Ai(fi@ +h) — filx)) + A2( fo +A) — fax)) = 
=i (fi @)h + o(h)) + A2( fA + o(h)) = 
= (Ar fi@) +42f@))h + off). 


If the functions in question are real-valued, the operations of multiplication and 
division (when the denominator is not zero) can also be performed. We have then 
the following theorem. 


Theorem 2 [f the functions f : E > Rand g: E > R, defined ona set E CR", 
are differentiable at the point x € E, then 


a) their product is differentiable at x and 


(Fg) x) = 8) fe) + fg’ @); (8.32) 


b) their quotient is differentiable at x if g(x) #0, and 


(4) w= 1 (g(x) f(x) — fOne'(a)) (8.33) 
eg) gay me 


The proof of this theorem is the same as the proof of the corresponding parts of 
Theorem 1 in Sect. 5.2, so that we shall omit the details. 

Relations (8.31), (8.32), and (8.33) can be rewritten in the other notations for the 
differential. To be specific: 


d(A1 fix) + Ao fa) (x) = Ard fi + Ard fo)(x), 
d(f - g)(x) = g(x) df (x) + f(x) dg(x), 


a(Z)o = eG df (x) — f(x) dg(x)). 
g g?(x) 
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Let us see what these equalities mean in the coordinate representation of the 
mappings. We know that if a mapping yg : E — R” that is differentiable at an interior 
point x of the set E C R” is written in the coordinate form 


g' (xl, ee Xl) 
g(x) = ; 
Pe at 
then the Jacobi matrix 
dig! ang! 
gy (x)= : — : (x) = (419/)(x) 
01g" Ing” 


will correspond to its differential dg(x) : R” — R” at this point. 

For fixed bases in R” and R” the correspondence between linear transformations 
L:R” > R" andm xn matrices is one-to-one, and hence the linear transformation 
L can be identified with the matrix that defines it. 

Even so, we shall as a rule use the symbol f’(x) rather than df (x) to denote the 
Jacobi matrix, since it corresponds better to the traditional distinction between the 
concepts of derivative and differential that holds in the one-dimensional case. 

Thus, by the uniqueness of the differential, at an interior point x of E we obtain 
the following coordinate notation for (8.31), (8.32), and (8.33), denoting the equality 
of the corresponding Jacobi matrices: 


(8: (Ar fe + 2 fZ)) Ox) = (A198: ff] +428) (@) 
(Gi=1,...,m,j=1,...,n), (8.31') 
(a:(f ; g))(x) = g(x); fx)+ f@)digx) G=l1,...,m), (8.32’) 


1 
(«(4))oo =a (g(x); f(x) _ f (x)d¢g(x)) (i=1,...,m). (8.33) 
g g(x) 


It follows from the elementwise equality of these matrices, for example, that 
the partial derivative with respect to the variable x’ of the product of real-valued 
functions f(x!,...,x’) and g(x!,...,x”) should be taken as follows: 


a(f - g) af 
Sah”) = (eh) SG (Hx) + 


PO ci) Pa), 


We note that both this equality and the matrix equalities (8.31’), (8.32’), and 
(8.33’) are obvious consequences of the definition of a partial derivative and the 
usual rules for differentiating real-valued functions of one real variable. However, 
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we know that the existence of partial derivatives may still turn out to be insufficient 
for a function of several variables to be differentiable. For that reason, along with 
the important and completely obvious equalities (8.31’), (8.32'), and (8.33’), the 
assertions about the existence of a differential for the corresponding mapping in 
Theorems | and 2 acquire a particular importance. 

We remark finally that by induction using (8.32) one can obtain the relation 


Afi ---s Fe) = fa fi) AAG) + + At Se) df) 


for the differential of a product (f{ --- f;,) of differentiable real-valued functions. 


8.3.2 Differentiation of a Composition of Mappings (Chain Rule) 


a. The Main Theorem 


Theorem 3 /f the mapping f : X — Y ofa set X CR” intoa set Y CR" is differ- 
entiable at a point x € X, and the mapping f : Y — R* is differentiable at the point 
y = f(x) € Y, then their composition go f : X > Ri is differentiable at x and the 
differential d(g o f): TRY > TR pay) of the composition equals the composition 
dg(y) o df (x) of the differentials 


: z k 
df(x): TR? > TR ay: dg(y): TR, > TRavy): 


The proof of this theorem repeats almost completely the proof of Theorem 2 of 
Sect. 5.2. In order to call attention to a new detail that arises in this case, we shall 
nevertheless carry out the proof again, without going into technical details that have 
already been discussed, however. 


Proof Using the differentiability of the mappings f and g at the points x and y = 
f (x), and also the linearity of the differential g’(x), we can write 


(go f(x +h) —(g0 f)(x) = 
=g(f(«+h)) -8(f@)) = 
=e (f@)(f@ +h) -— f@)) +0(f@+h)- f@)= 
= g'(y)(f(@)h + o(h)) + 0( f(x +h) — f(x) = 
= g'(y)(f'(@)h) + 8/(y)(o(h)) + 0( f & +h) — f(x) = 
=(g'(y) 0 f’(x))h +(x; h), 


where g’(y) o f’(x) is a linear mapping (being a composition of linear mappings), 
and 


a(x; h) = g'(y)(o(h)) +0( fi +h) — f(x). 
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But, as relations (8.17) and (8.18) of Sect. 8.1 show, 


g'(y)(o(h)) =o(h) ash 0, 
f(x th) — f(x) = fh + o(h) = O(h) +0(h) = O(h) ash 0, 
and 
o(f (x +h) — f(x)) =0(O(h)) =o(h) ash 0. 


Consequently, 


a(x:h) =o(h) +0(h) =o(h) ash—>0, 


and the theorem is proved. 


When rewritten in coordinate form, Theorem 3 means that if x is an interior point 
of the set X and 


fl) + On fl) 
f@= "a, ; = (8; f/)(x), 
anf" (x) see Om f" (x) 


and y = f(x) is an interior point of the set Y and 


digh(y) +» dng! (y) 
gS : — = (dj8°)0), 
digk(y) «++ dngkcy) 
then 
(glo f(x) + In(glo f(x) 
(go f)'(x) = me = (8; (g/ 0 f))(x) = 
A(gko f)(x) ++ An(gko f)(x) 
digl(y) -- dngh)\ (aft) + On fl) 
digk(y) +++ dngk(yy J Vf") + Om f"(x) 
= (dj8'(y)- 9; fi («)). 
In the equality 


(d:(g! o f)) (x) = (a8! (f@)) - 4 £7) (8.34) 


summation is understood on the right-hand side with respect to the index j over its 
interval of variation, that is, from 1 ton. 
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In contrast to Eqs. (8.31’), (8.32’), and (8.33’), relation (8.34) is nontrivial even 
in the sense of elementwise equality of the matrices occurring in it. 
Let us now consider some important cases of the theorem just proved. 


b. The Differential and Partial Derivatives of a Composite Real- Valued 
Function 


Let z= g(y!,..., y”) be areal-valued function of the real variables y!,..., y”, each 
of which in turn is a function y/ = fi Ce acige) (j =1,...,n) of the variables 
x!,...,x". Assuming that the functions g and f/ are differentiable (j = 1,...,n), 


Macl 2 f ) (x) of the composition of the mappings f : 


let us find the partial derivative 
X—>Yandg:Y-R. 
According to formula (8.34), in which / = | under the present conditions, we 


find 


0:(g 0 f)(x) = djg(f(x)) - 4 f(x), (8.35) 
or, in notation that shows more detail, 
a(go f) dg ay! dg ay” 
Byes PET ee ig me ee ee g alte 
ox! ox! dy* ox! dy” dx! 


= d18(f(x)) 9; f 1 (x) ++» + Ong ( f(x) 9: f"(). 


c. The Derivative with Respect to a Vector and the Gradient of a Function 
at a Point 


Consider the stationary flow of a liquid or gas in some domain G of R?. The term 
“stationary” means that the velocity of the flow at each point of G does not vary 
with time, although of course it may vary from one point of G to another. Sup- 
pose, for example, f(x) = f(x!,x*,x°) is the pressure in the flow at the point 
x = (x!,x?, x3) © G. If we move about in the flow according to the law x = x(t), 
where ¢ is time, we shall record a pressure (f o x)(t) = f(x(t)) at time ¢. The rate 
of variation of pressure over time along our trajectory is obviously the derivative 
apex) (t) of the function (f o x)(t) with respect to time. Let us find this derivative, 
assuming that f (x!, x7, x) is a differentiable function in the domain G. By the rule 
for differentiating composite functions, we find 


d(f ox) r) ) 
ee 


cot “Fen + “Fev, 836 


where i/(r) = ““(r) (i = 1,2, 3). 
Since the vector (x!, x7, 3) = v(t) is the velocity of our displacement at time t 
and (01 f, 02 f, 03 f)(x) is the coordinate notation for the differential df (x) of the 
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function f at the point x, Eq. (8.36) can also be rewritten as 


d(fo 
oF) =df (x(t)) v(t), (8.37) 


that is, the required quantity is the value of the differential df (x(t)) of the function 
f(x) at the point x(t) evaluated at the velocity vector v(t) of the motion. 
In particular, if we were at the point x9 = x(0) at time ¢ = 0, then 


d(fo 
a” ©) = df (xo) v, (8.38) 


where v = v(0) is the velocity vector at time t = 0. 

The right-hand side of (8.38) depends only on the point x9 € G and the velocity 
vector v that we have at that point; it is independent of the specific form of the 
trajectory x = x(t), provided the condition x(0) = v holds. That means that the 
value of the left-hand side of Eq. (8.38) is the same on any trajectory of the form 


x(t)=xo9 + vt+ a(t), (8.39) 


where a(t) = o(t) as t > 0, since this value is completely determined by giving the 
point xo and the vector v € TR}, attached at that point. In particular, if we wished 
to compute the value of the left-hand side of Eq. (8.38) directly (and hence also the 
right-hand side), we could choose the law of motion to be 


X(t) = x0 + uf, (8.40) 
corresponding to a uniform motion at velocity v under which we are at the point 


x(0) = xo at time t = 0. 
We now give the following 


Definition 1 If the function f(x) is defined in a neighborhood of the point x9 € R” 
and the vector v € TRY, is attached at the point xo, then the quantity 


fo + a — f (xo) (8.41) 


Dy f (Xo) := lim 


(if the indicated limit exists) is called the derivative of f at the point xo with respect 
to the vector v or the derivative along the vector v at the point xo. 


It follows from these considerations that if the function f is differentiable at the 
point xo, then the following equality holds for any function x(t) of the form (8.39), 
and in particular, for any function of the form (8.40): 


d(f ox) 
dt 


Dy f (x0) = (0) =df (xo) v. (8.42) 
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In coordinate notation, this equality says 


ul (xo)v™. (8.43) 


ox™ 


) 
Dy f (xo) = oF (xo)u! feet 


In particular, for the basis vectors e; = (1,0,...,0),...,é@n = (0,...,0, 1) this 
formula implies 


0 
De; f (Xo) = (x0) G=1,...,m). 


By virtue of the linearity of the differential df (xo), we deduce from Eq. (8.42) 
that if f is differentiable at the point xo, then for any vectors v1, v2 € TRY and any 
A 1,A2 € R the function has a derivative at the point x9 with respect to the vector 
(Ayv, + A20v2) € TRY and that 


Dyyvi tov. f X0) = A1 Do, f (Xo) + A2D v2 f (X0)- (8.44) 


If R” is regarded as a Euclidean space, that is, as a vector space with an inner 
product, then (see Sect. 8.1) it is possible to write any linear functional L(v) as the 
inner product (&, v) of a fixed vector § = &(L) and the variable vector v. 

In particular, there exists a vector € such that 


df (xo)u = (&, v). (8.45) 


Definition 2 The vector § ¢ TRY, corresponding to the differential df (xo) of the 
function f at the point xo in the sense of Eq. (8.45) is called the gradient of the 
function at that point and is denoted grad f (xo). 

Thus, by definition 


df (xo)v = (grad Ff (xo); v). (8.46) 


If a Cartesian coordinate system has been chosen in R”, then, by comparing 
relations (8.42), (8.43), and (8.46), we conclude that the gradient has the following 
representation in such a coordinate system: 


of of 
grad Ff (xo) = (+. Lean ao) (xo). (8.47) 
We shall now explain the geometric meaning of the vector grad f (xo). 

Let e € TRY, be a unit vector. Then by (8.46) 


De f (xo) = |grad f (xo)| cos, (8.48) 


where ¢ is the angle between the vectors e and grad f (xo). 

Thus if grad f(xo) # 0 and e = || grad f (xo)||~! grad f (xo), the derivative 
De f (xo) assumes a maximum value. That is, the rate of increase of the function f 
(expressed in the units of f relative to a unit length in R”) is maximal and equal 
to || grad f(xo)|| for motion from the point xo precisely when the displacement is 
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in the direction of the vector grad f (xo). The value of the function decreases most 
sharply under displacement in the opposite direction, and the rate of variation of the 
function is zero in a direction perpendicular to the vector grad f (x). 

The derivative with respect to a unit vector in a given direction is usually called 
the directional derivative in that direction. 

Since a unit vector in Euclidean space is determined by its direction cosines 


e€ = (COSQ],...,COSQAm), 


where a; is the angle between the vector e and the basis vector e; in a Cartesian 
coordinate system, it follows that 


Def (xo) = (grad f (x0), e) = “Too) cosa +++++ Ho) COS Om. 

The vector grad f (xo) is encountered very frequently and has numerous applica- 
tions. For example the so-called gradient methods for finding extrema of functions 
of several variables numerically (using a computer) are based on the geometric prop- 
erty of the gradient just noted. (In this connection, see Problem 2 at the end of this 
section.) 

Many important vector fields, such as, for example, a Newtonian gravitational 
field or the electric field due to charge, are the gradients of certain scalar-valued 
functions, known as the potentials of the fields (see Problem 3). 

Many physical laws use the vector grad f in their very statement. For example, 
in the mechanics of continuous media the equivalent of Newton’s basic law of dy- 
namics ma = F is the relation 


pa= -— grad p, 


which connects the acceleration a = a(x, f) in the flow of an ideal liquid or gas 
free of external forces at the point x and time ¢ with the density of the medium 
p = e(x,t) and the gradient of the pressure p = p(x, t) at the same point and time 
(see Problem 4). 

We shall discuss the vector grad f again later, when we study vector analysis and 
the elements of field theory. 


8.3.3 Differentiation of an Inverse Mapping 


Theorem 4 Let f : U(x) > V(y) be a mapping of a neighborhood U(x) C R” of 
the point x onto a neighborhood V(y) C R” of the point y = f (x). Assume that f 
is continuous at the point x and has an inverse mapping f~! : V(y) > U(x) that 
is continuous at the point y. 

Given these assumptions, if the mapping f is differentiable at x and the tan- 
gent mapping f'(x): TR" > TR® to f at the point x has an inverse ror: 
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TRY — TR", then the mapping f7!:V(y) > U(x) is differentiable at the point 
y = f (x), and the following equality holds: 


GG Yosl[ro] |. 


Thus, mutually inverse differentiable mappings have mutually inverse tangent 
mappings at corresponding points. 


Proof We use the following notation: 
f@=y, f@eth=ytt, t=fa+h)—f@), 
so that 
fl@sx, f'lotp=xth, haf 'y+o—-f 0). 


We shall assume that / is so small that x + h € U(x), and hence y+t € V(y). 
It follows from the continuity of f at x and f~! at y that 


t=f(x+h)-—f(«x)-0 ash-0 (8.49) 
and 
h=f'~+n—f7'(y>0 ast>0. (8.50) 
It follows from the differentiability of f at x that 
t= f'(x)ht+o(h) ash—0O, (8.51) 


that is, we can even assert that t = O(h) as h > 0 (see relations (8.17) and (8.18) 
of Sect. 8.1). 

We shall show that if f’(x) is an invertible linear mapping, then we also have 
h=O(t)ast— 0. 

Indeed, we find successively by (8.51) that 


[fo] e=nt[f@] loth) ash 0, 


[P@] lt =htoh) oer 
- (8.52) 
ILC] | = Wall — Jom] ash — 0, 
IL] = Anal for |||] <5, 


where the number 6 > 0 is chosen so that ||o(h)|| < 5 llAll when ||| < 6. Then, 
taking account of (8.50), that is, the relation h > 0 as t > 0, we find 


all <2 [fo] ‘el = O(itl) ast +0, 
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which is equivalent to 
h=O(t) ast—0. 


From this it follows in particular that 
o(h)=o(t) ast—0. 
Taking this relation into account, we find by (8.50) and (8.52) that 
h=[f'@o] ‘t+0@) ast 0 


or 


floto-—fIlM=[f@]tt+0® ast 0. 


It is known from algebra that if the matrix A corresponds to the linear trans- 
formation L : R’” — R’”, then the matrix A~! inverse to A corresponds to the lin- 
ear transformation L~! : R’” —> R” inverse to L. The construction of the elements 
of the inverse matrix is also known from algebra. Consequently, the theorem just 
proved provides a direct recipe for constructing the mapping (f~!)'(y). 

We remark that when m = 1, that is, when R” = R, the Jacobian of the mapping 
f : U(x) > V(y) at the point x reduces to the single number f’(x) — the deriva- 
tive of the function f at x — and the linear transformation f’(x) : TR, > TRy 
reduces to multiplication by that number: h +> f’(x)h. This linear transforma- 
tion is invertible if and only if f’(x) 4 0, and the matrix of the inverse mapping 
[feo : TRy > TR, also consists of a single number, equal to aes ae that 
is, the reciprocal of f’(x). Hence Theorem 4 also subsumes the rule for finding the 
derivative of an inverse function proved earlier. 


8.3.4 Problems and Exercises 


1. a) We shall regard two paths th x(t) and tt x2(f) as equivalent at the point 
xo € R” if x1 (0) = x2(0) = xo and d(x, (ft), x2(t)) = o(t) as t > 0. 

Verify that this relation is an equivalence relation, that is, it is reflexive, symmet- 
ric, and transitive. 

b) Verify that there is a one-to-one correspondence between vectors v € TRY) 
and equivalence classes of smooth paths at the point xo. 

c) By identifying the tangent space TRY, with the set of equivalence classes of 
smooth paths at the point x9 € IR”, introduce the operations of addition and multi- 
plication by a scalar for equivalence classes of paths. 

d) Determine whether the operations you have introduced depend on the coor- 
dinate system used in R”. 
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2. a) Draw the graph of the function z = x* + 4y*, where (x, y, z) are Cartesian 
coordinates in R?. 

b) Let f : G > R be a numerically valued function defined on a domain G C 
R”. A level set (c-level) of the function is a set E C G on which the function 
assumes only one value (f(E) =c). More precisely, E = f —!(c). Draw the level 
sets in R* for the function given in part a). 

c) Find the gradient of the function f(x, y) = x* +4y?, and verify that at any 
point (x, y) the vector grad f is orthogonal to the level curve of the function f 
passing through the point. 

d) Using the results of a), b), and c), lay out what appears to be the shortest path 
on the surface z = x” + 4y* descending from the point (2, 1, 8) to the lowest point 
on the surface (0, 0, 0). 

e) What algorithm, suitable for implementation on a computer, would you pro- 
pose for finding the minimum of the function f (x, y) = x* + 4y?? 


3. We say that a vector field is defined in a domain G of R” if a vector v(x) € TRY 
is assigned to each point x € G. A vector field v(x) in G is called a potential field 
if there is a numerical-valued function U : G > R such that v(x) = grad U(x). The 
function U(x) is called the potential of the field v(x). (In physics it is the function 
—U (x) that is usually called the potential, and the function U(x) is called the force 
function when a field of force is being discussed.) 


a) On a plane with Cartesian coordinates (x, y) draw the field grad f(x, y) 
for each of the following functions: f\(x, y) =x? + y7; fo(x, y) = —(x? + y?); 
J3(x, y) = arctan(x/y) in the domain y > 0; f4(x, y) = xy. 

b) By Newton’s law a particle of mass m at the point 0 € R? attracts a particle of 
mass | at the point x € R* (x 4 0) with force F = —m|r|~*r, where r is the vector 
Ox (we have omitted the dimensional constant Go). Show that the vector field F(x) 
in R3\0 is a potential field. 

c) Verify that masses m; (i = 1,...,7) located at the points (&, 7;,¢) @ = 
1,...,7) respectively, create a Newtonian force field except at these points and that 
the potential is the function 


n 


U@yo=) m 
“ =I V@—&P + 0—mP2+@—G) 


d) Find the potential of the electrostatic field created by point charges qg; (i = 
1,...,7) located at the points (&;, n;, ¢;) (i = 1, ...,) respectively. 


4. Consider the motion of an ideal incompressible liquid in a space free of external 
forces (in particular, free of gravitational forces). 

Let v= v(x, y,z,t),a=a(x,y,Z,t), p= p(x, y,z,t), and p= p(x, y,z,t) be 
respectively the velocity, acceleration, density, and pressure of the fluid at the point 
(x, y, z) of the medium at time f. 

An ideal liquid is one in which the pressure is the same in all directions at each 
point. 
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a) Distinguish a volume of the liquid in the form of a small parallelepiped, one 
of whose edges is parallel to the vector grad p(x, y, z,t) (where grad p is taken 
with respect to the spatial coordinates). Estimate the force acting on this volume 
due to the pressure drop, and give an approximate formula for the acceleration of 
the volume, assuming the fluid is incompressible. 

b) Determine whether the result you obtained in a) is consistent with Euler’s 
equation 


pa= — grad p. 


c) A curve whose tangent at each point has the direction of the velocity vector 
at that point is called a streamline. The motion is called stationary if the functions 
v, a, e, and p are independent of fr. Using b), show that along a streamline in the 
stationary flow of an incompressible liquid the quantity sivil + p/p is constant 
(Bernoulli’s law”). 

d) How do the formulas in a) and b) change if the motion takes place in the 
gravitational field near the surface of the earth? Show that in this case 


pa=-— grad(gz+ p). 


so that now the quantity 5 I|v\? + ¢z + p/p is constant along each streamline of the 
stationary motion of an incompressible liquid, where g is the gravitational acceler- 
ation and z is the height of the streamline measured from some zero level. 

e) Explain, on the basis of the preceding results, why a load-bearing wing has a 
characteristic convex-upward profile. 

f) An incompressible ideal liquid of density p was used to fill a cylindrical glass 
with a circular base of radius R to a depth h. The glass was then revolved about its 
axis with angular velocity w. Using the incompressibility of the liquid, find the equa- 
tion z= f(x, y) of its surface in stationary mode (see also Problem 3 of Sect. 5.1). 

g) From the equation z = f(x, y) found in part f) for the surface, write a formula 
p = p(x, y,z) for the pressure at each point (x, y, z) of the volume filled by the 
rotating liquid. Check to see whether the equation pa = — grad(gz + p) of part d) 
holds for the formula that you found. 

h) Can you now explain why tea leaves sink (although not very rapidly!) and 
why they accumulate at the center of the bottom of the cup, rather than its side, 
when the tea is stirred? 


5. Estimating the errors in computing the values of a function. 


a) Using the definition of a differentiable function and the approximate equality 
Af (x; h) + df (x)h, show that the relative error 6 = 6( f(x); h) in the value of 
the product f(x) = x!---x" of m nonzero factors due to errors in determining the 
factors themselves can be found in the form 6 ~ ys , i, Where 4; is the relative 
error in the determination of the ith factor. 


Daniel Bernoulli (1700-1782) — Swiss scholar, one of the outstanding physicists and mathemati- 
cians of his time. 
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b) Using the equality din f(x) = 4. df(x), obtain the result of part a) again 


f(x) 
and show that in general the relative error in a fraction 
fish 
=" (x1, $828 § Xm) 
§1°°* 8k 


can be found as the sum of the relative errors of the values of the functions 
Fis eees fins Bl 000s ke 


6. Homogeneous functions and Euler’s identity. A function f : G —> R defined 
in some domain G C R” is called homogeneous (resp. positive-homogeneous) of 
degree n if the equality 


fAx) =A" f(x) (resp. f (Ax) = |Al" f(x) 


holds for any x € R” and A € R such that x € G and Ax EG. 
A function is locally homogeneous of degree n in the domain G if it is a homo- 
geneous function of degree n in some neighborhood of each point of G. 


a) Prove that in a convex domain every locally homogeneous function is homo- 
geneous. 

b) Let G be the plane R? with the ray L = {(x, y) € R* | x =2 A y > 0} re- 
moved. Verify that the function 


4 . 
y'/x, ifx>2Ay>0, 
fa, y= 4°, 


yy at other points of the domain, 


is locally homogeneous in G, but is not a homogeneous function in that domain. 
c) Determine the degree of homogeneity or positive homogeneity of the follow- 
ing functions with their natural domains of definition: 


Fi(at see. 8™) axl? x28 pe pl: 
1,.2 3,4 
1.2.3 4) XO +X . 
falel x7, xx ) = Sa eas 
I 
Pais) S eee 


d) By differentiating the equality f (tx) = t” f(x) with respect to t, show that if 
a differentiable function f : G — R is locally homogeneous of degree n in a domain 
G CR", it satisfies the following Euler identity for homogeneous functions: 


0 0 
Pe Glnce”) feta ae Ceo) =nf(x',...,x™). 
e) Show that if Euler’s identity holds for a differentiable function f : G— Rin 
a domain G, then that function is locally homogeneous of degree n in G. 
Hint: Verify that the function g(t) = t~” f (tx) is defined for every x € G and is 
constant in some neighborhood of 1. 
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7. Homogeneous functions and the dimension method. 

1°. The dimension of a physical quantity and the properties of functional relations 
between physical quantities. 

Physical laws establish interconnections between physical quantities, so that if 
certain units of measurement are adopted for some of these quantities, then the units 
of measurement of the quantities connected with them can be expressed in a certain 
way in terms of the units of measurement of the fixed quantities. That is how the 
basic and derived units of different systems of measurement arise. 

In the International System, the basic mechanical units of measurement are taken 
to be the unit of length (the meter, denoted m), mass (the kilogram, denoted kg), and 
time (the second, denoted s). 

The expression of a derived unit of measurement in terms of the basic mechanical 
units is called its dimension. This definition will be made more precise below. 

The dimension of any mechanical quantity is written symbolically as a formula 
expressing it in terms of the symbols L, M, and T proposed by Maxwell® as the 
dimensions of the basic units mentioned above. For example, the dimensions of 
velocity, acceleration, and force have respectively the forms 


[v] =LT!, [a] = LT~?, [F]=MLT~. 


If physical laws are to be independent of the choice of units of measurement, one 
expression of that invariance should be certain properties of the functional relation 


Xo = f(X1, 6+ Xky MEL, ++ Xn) (*) 


between the numerical characteristics of the physical quantities. 

Consider, for example, the relation c = f(a, b) = Va? + b* between the lengths 
of the legs and the length of the hypotenuse of a right triangle. Any change of scale 
should affect all the lengths equally, so that for all admissible values of a and b the 
relation f (aa, ab) = y(a) f (a, b) should hold, and in the present case g(a) =a. 

A basic (and, at first sight, obvious) presupposition of dimension theory is that a 
relation (*) claiming physical significance must be such that when the scales of the 
basic units of measurement are changed, the numerical values of all terms of the 
same type occurring in the formula must be multiplied by the same factor. 

In particular, if x1, x2,x3 are basic independent physical quantities and the re- 
lation (x1, x2,x3)  f(%1,x2,x3) expresses the way a fourth physical quantity 
depends on them, then, by the principle just stated, for any admissible values of 
X1,%X2, x3 the equality 


SF (1X1, 2X2, 3X3) = (A, H2)013) f (x2, X2, x3), (*) 


must hold with some particular function 9g. 


®J.C. Maxwell (1831-1879) — outstanding British physicist. He created the mathematical theory of 
the electromagnetic field, and is also famous for his research in the kinetic theory of gases, optics, 
and mechanics. 
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The function ¢ in (**) characterizes completely the dependence of the numerical 
value of the physical quantity in question on a change in the scale of the basic fixed 
physical quantities. Thus, this function should be regarded as the dimension of that 
physical quantity relative to the fixed basic units of measurement. 

We now make the form of the dimension function more precise. 


a) Letx } f(x) bea function of one variable satisfying the condition f (ax) = 
g(a) f(x), where f and ¢ are differentiable functions. 

Show that g(a) =a. 

b) Show that the dimension function ¢ in Eq. (**) always has the form of : of, 
a}, where the exponents d,, dz, d3 are certain real numbers. Thus if, for exam- 
ple, the basic units of L, M, and T are fixed, then the set (d,, dz, d3) of exponents 
expressed in the power representation L“1 M“”T% can also be regarded as the di- 
mension of the given physical quantity. 

c) In part b) it was found that the dimension function is always a power function, 
that is, it is a homogeneous function of a certain degree with respect to each of the 
basic units of measurement. What does it mean if the degree of homogeneity of the 
dimension function of a certain physical quantity relative to one of the basic units 
of measurement is zero? 

2° The I1-theorem and the dimension method. 

Let [x;] = X; @ =0,1,...,7) be the dimensions of the physical quantities oc- 
curring in the law (*). 

Assume that the dimensions of xo, xx41, ..-,X, can be expressed in terms of the 
dimensions of x,,..., x;, that is, 


1 k 
P P 
[xo] = Xo = X}°--- X;,°, 


ee 
[epi] = Xap = Xp) XP G=1,...,2—b). 


d) Show that the following relation must then hold, along with (*): 


if k 1 k 1 k 
Po Po Pp Pp Ph—k Pn—k 
Oy +O X= f (ext. 30pm, oF" 0 Kelpie, 04” ea,” tn), 
(***) 
e) Ifx,,...,x% are independent, we set a; = ces 1 A= x in (***), Verify 
that when this is done, (***) yields the equality 
XO Xk+1 Xn 
PO an ate PI Pi Pak pe, 
it nh— 
mat Xp Xp Xp xy x 
which is a relation 
T= f(,...,1,1h,...,ln-k) (SPEE) 


involving the dimensionless quantities IT, IT, ..., Tn—x. 
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Thus we obtain the following 


IT-theorem of dimension theory /f the quantities x,,...,xx in relation (*) are 
independent, this relation can be reduced to the function (****) of n — k dimen- 
sionless parameters. 


f) Verify that if k =n, the function f in relation (*) can be determined up to a 
numerical multiple by using the [7-theorem. Use this method to find the expression 
c(%9)/1/g for the period of oscillation of a pendulum (that is, a mass m suspended 
by a thread of length / and oscillating near the surface of the earth, where go is the 
initial displacement angle). 

g) Find a formula P = c./mr/F for the period of revolution of a body of mass 
m held in a circular orbit by a central force of magnitude F’. 

h) Use Kepler’s law (P;/ Pp)? = (ry / r2)°, which establishes for circular orbits 
a connection between the ratio of the periods of revolution of planets (or satellites) 
and the ratio of the radii of their orbits, to find, as Newton did, the exponent @ in 
the law of universal gravitation F = G22. 


8.4 The Basic Facts of Differential Calculus of Real- Valued 
Functions of Several Variables 


8.4.1 The Mean-Value Theorem 


Theorem 1 Let f :G— R be a real-valued function defined in a region G C R", 
and let the closed line segment [x,x + h] with endpoints x and x + h be con- 
tained in G. If the function f is continuous at the points of the closed line segment 
[x,x + h] and differentiable at points of the open interval |x,x + h[, then there 
exists a point € € |x, x + h[ such that the following equality holds: 


fe +h)— fx) = f'n. (8.53) 


Proof Consider the auxiliary function 
F(t)= f(x +th) 


defined on the closed interval 0 < t < 1. This function satisfies all the hypotheses of 
Lagrange’s theorem: it is continuous on [0, 1], being the composition of continuous 
mappings, and differentiable on the open interval JO, 1[, being the composition of 
differentiable mappings. Consequently, there exists a point 6 € JO, 1[ such that 


F(1) — F(0) = F'() «1. 


But F(1)= f(x +h), F(O) = f(x), F’(0) = f' (x + 6A)h, and hence the equal- 
ity just written is the same as the assertion of the theorem. 
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We now give the coordinate form of relation (8.53). 
If x = (x!,...,x"), h = (h!,...,h™), and € = (x! + 0n!,..., x" + Oh”), 
Eq. (8.53) means that 


fath)—f@)=f(xital,...j<" +h”) — f(t. a") = 


h! 
a a 
=f@n=(F4... =) 2g 
X 


“9 
ox™ hm 


=O f(E\A +--+: + On f(E)A™ = 


m 
= are’ +0h',...,x" +0h™)h'. 
i=l 


Using the convention of summation on an index that appears as both superscript 
and subscript, we can finally write 


f(x t+hl,...x" +h) — f(xt,.. 2") = 
= 0 f(x! +0h',...,.x"+0h™)h', (8.54) 


where 0 < @ < | and @ depends on both x and h. 


Remark Theorem | is called the mean-value theorem because there exists a certain 
“average” point € € |x, x + h[ at which Eq. (8.53) holds. We have already noted in 
our discussion of Lagrange’s theorem (Sect. 5.3.1) that the mean-value theorem is 
specific to real-valued functions. A general finite-increment theorem for mappings 
will be proved in Chap. 10 (Part 2). 

The following proposition is a useful corollary of Theorem 1. 


Corollary [f the function f :G— R is differentiable in the domain G C R™ and 
its differential equals zero at every point x € G, then f is constant in the domain G. 


Proof The vanishing of a linear transformation is equivalent to the vanishing of all 
the elements of the matrix corresponding to it. In the present case 


df )h = (01 f,---5 Om /Y()h, 


and therefore 0; f(x) =--- = 0, f(x) = 0 at every point x € G. 
By definition, a domain is an open connected set. We shall make use of this fact. 
We first show that if x € G, then the function f is constant in a ball B(x; r) C G. 
Indeed, if (x +h) € B(x; r), then [x,x+h] C B(x; r) C G. Applying relation (8.53) 
or (8.54), we obtain 


fx+h)— fx)= f'E)h=0-h=0, 
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that is, f(x + h) = f(x), and the values of f in the ball B(x; r) are all equal to the 
value at the center of the ball. 

Now let x9, x1 € G be arbitrary points of the domain G. By the connectedness 
of G, there exists a path th x(t) € G such that x(0) = x9 and x(1) = x1. We 
assume that the continuous mapping f +> x(t) is defined on the closed interval 0 < 
t < 1. Let B(xo; r) be a ball with center at x9 contained in G. Since x(0) = xo and 
the mapping t +> x(t) is continuous, there is a positive number 6 such that x(t) € 
B(xo;r) C G for 0 <t <6. Then, by what has been proved, (f o x)(t) = f(xo) on 
the interval [0, 6]. 

Let / = sup 6, where the upper bound is taken over all numbers 6 € [0, 1] such that 
(f ox)(t) = f (xo) on the interval [0, 5]. By the continuity of the function f (x(t)) 
we have f(x(/)) = f (xo). But then / = 1. Indeed, if that were not so, we could take 
aball B(x(J); r) C G, in which f(x) = f(x()) = f (xo), and then by the continuity 
of the mapping t > x(t) find A > 0 such that x(t) € Bx); r) for] <t</+A. 
But then (f ox)(t) = f(x()) = f(xo) for0 <t <1+ A, andsol ¢supé. 

Thus we have shown that (f o x)(t) = f(xo) for any ¢ € [0, 1]. In particular 
(f ox)C) = f(x1) = f (xo), and we have verified that the values of the function 
f :G-— R are the same at any two points xo, x1 € G. 


8.4.2 A Sufficient Condition for Differentiability of a Function 
of Several Variables 


Theorem 2 Let f : U(x) > R be a function defined in a neighborhood U(x) C R™ 
of the point x =(x!,...,x"). 

If the function f has all partial derivatives ar, bites af at each point of the 
neighborhood U(x) and they are continuous at x, then f is differentiable at x. 


Proof Without loss of generality we shall assume that U (x) is a ball B(x; r). Then, 
together with the points x = (xt, oo.) and x +h= (x! +h! x +h), 
the points (x!,4? 42,000,207 + A) wc |e bh”) and the 
lines connecting them must also belong to the domain U(x). We shall use this fact, 
applying the Lagrange theorem for functions of one variable in the following com- 
putation: 


fe th)—fx)= f(x thle th”) — f (xt... x™) = 
= f(x! +hly..x™ th”) — f(x xr $n’... x +h) + 
+ f(xtx? +h, x" +h”) — 
— f(x x tA. Fh) pet 


+ f (xix?) a +h”) = fl x™) = 
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=a f(x! +0'nl x? +h7,....x"™+h™)hl + 
Hf (x x7 $O07n, HH, EA)? +e $ 
ae Gn f (x, x’, _ (xm d yn +e™h™)h™, 


So far we have used only the fact that the function f has partial derivatives with 
respect to each of its variables in the domain U(x). 

We now use the fact that these partial derivatives are continuous at x. Continuing 
the preceding computation, we obtain 


fx th)— f(x) =a f(x',...,x™)hi +atalt 
+ dof (x!,...,2™)h? + 07h? +.--4+ 
+ mf (x',....x™)h™ +a™h”™, 


where the quantities a1,...,@m tend to zero as h > 0 by virtue of the continuity of 
the partial derivatives at the point x. 
But this means that 


fath—-f@=La)h+o(h) ash—0, 


where L(x)h =, f(x!,...,x™) hi +---+ On f(x',...,x™h™. 


It follows from Theorem 2 that if the partial derivatives of a function f:G— R 
are continuous in the domain G C R’”, then the function is differentiable at that 
point of the domain. 

Let us agree from now on to use the symbol C“!)(G; R), or, more simply, C“ (G) 
to denote the set of functions having continuous partial derivatives in the domain G. 


8.4.3. Higher-Order Partial Derivatives 


If a function f :G — R defined in a domain G C R” has a partial derivative JF (x) 
with respect to one of the variables x oo. x, this partial derivative is a function 
0; f : G > R, which in turn may have a partial derivative 0; (0; f)(x) with respect 
to a variable x/. 

The function 0; (0; f) : G — R is called the second partial derivative of f with 
respect to the variables x' and x/ and is denoted by one of the following symbols: 


oy 
dxJax! 


dji f (x), (x). 

The order of the indices indicates the order in which the differentiation is carried 
out with respect to the corresponding variables. 

We have now defined partial derivatives of second order. 
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If a partial derivative of order k 


ak f 


Oxil --- Oxtk 


0j,---i, f (%) = (x) 


has been defined, we define by induction the partial derivative of order k + 1 by the 
relation 


Oji,--i, f (X) = 0; (Oj.--ig Ay). 


At this point a question arises that is specific for functions of several variables: 
Does the order of differentiation affect the partial derivative computed? 


Theorem 3 [f the function f :G — R has partial derivatives 


af a f 


axtdxi~ axlax! 


(x) 


in a domain G, then at every point x € G at which both partial derivatives are 
continuous, their values are the same. 


Proof Let x € G be a point at which both functions 0;; f : G > R and 0;; f: G > 
R are continuous. From this point on all of our arguments are carried out in the 
context of a ball B(x; r) C G,r > 0, which is a convex neighborhood of the point x. 
We wish to verify that 


a? a? 
NED, 3 gy OTN i), 
Ox! dxJ Ox/ dx! 

Since only the variables x’ and x/ will be changing in the computations to fol- 
low, we shall assume for the sake of brevity that f is a function of two variables 
f(x!, x’), and we need to verify that 


a f (c!,x2) = a° f (x! 2), 


axtax? 


if the two functions are both continuous at the point (x to x’). 
Consider the auxiliary function 


F(h',h?) = f(x! +h, x? +h’) _ f(x! +h', x?) _ f(x',x? +h’) + Fao Ns 
where the displacement h = (h!, h*) is assumed to be sufficiently small, namely so 
small that x +h € B(x;r). 

If we regard F (h!, h”) as the difference 


F(h',h?) =90) — 90), 
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where g(t) = Fe +th!, x? +h?) — Fe + th!, x2), we find by Lagrange’s the- 
orem that 


F(h',h?) =9'(01) = (A f(x! + O1h', x* +h?) — af (x! +Oh', x7))h'. 
Again applying Lagrange’s theorem to this last difference, we find that 
F(h',h?) = do) f(x! + 0)h', x? + Oh7)h7h!. (8.55) 
If we now represent F (h!, h?) as the difference 
F(h',h?) = $01) — 6), 
where G(t) = f(x! +h!, x? + th?) — f(x!, x? + th?), we find similarly that 
F(a! h?) = dif (x! +O, h!, x7 + Oh7)ath?. (8.56) 
Comparing (8.55) and (8.56), we conclude that 
Ooi f (x! + 01h, x + 02h") = O12 f (x! + O1h', x* + 627), (8.57) 


where 61, 02, 61, O> € JO, 1[. Using the continuity of the partial derivatives at the 
point (x!, x), as h > 0, we get the equality we need as a consequence of (8.57). 


doi f(x", x?) = dof (x!, x). 


We remark that without additional assumptions we cannot say in general that 
dij f (x) = 0;i f (x) if both of the partial derivatives are defined at the point x (see 
Problem 2 at the end of this section). 

Let us agree to denote the set of functions f : G > R all of whose partial deriva- 
tives up to order k inclusive are defined and continuous in the domain G Cc R” by 
the symbol C“(G; R) or C“(G). 

As a corollary of Theorem 3, we obtain the following. 


Proposition 1 /f f ¢ C“(G;R), the value Oi,...ig f (x) Of the partial derivative is 
independent of the order i,,..., i, of differentiation, that is, remains the same for 
any permutation of the indices i,,..., ix. 


Proof 1n the case k = 2 this proposition is contained in Theorem 3. 

Let us assume that the proposition holds up to order n inclusive. We shall show 
that then it also holds for order n + 1. 

But 0), i9..-i,41 f (%) = 0i, (0i9-.-i,,; f) (*). By the induction assumption the indices 
iz, ...,in41 can be permuted without changing the function 9j,...;,,,, f(x), and hence 
without changing 0j,...;,,,; f(%). For that reason it suffices to verify that one can 
also permute, for example, the indices i; and iz without changing the value of the 
derivative 0j,i5...i,4, f(). 
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Since 


Diqining F(X) = 9,47 Oi5---1,4, A), 


the possibility of this permutation follows immediately from Theorem 3. By the 
induction principle Proposition | is proved. 


Example 1 Let f(x) = f (x!, x?) be a function of class C“(G; R). 
Let h = (h!, h”) be such that the closed interval [x,x + A] is contained in the 
domain G. We shall show that the function 


g(t) = f(x + th), 


which is defined on the closed interval [0, 1], belongs to class C ) [0, 1] and find its 
derivative of order k with respect to f. 
We have 


g(t) =a f(x! +th!, x? + th?)h! + a f(x! +th!, x? +1h?)n’, 
g(t) = a1 f(x +thyh'h! + do) f(x + thyhrh! + 
+ df (e+ thyh'h? + doo f (x + th)hrh? = 
= du f(etthy(h')? +209 fx + thyh lh? + 09 f(x + th) (I). 
These relations can be written as the action of the operator (h!d; +h?a9): 
g(t) = (h' a, +h? a) f(x + th) =h'd; f (x +th), 
yg" (t) = (h'a; + hao)” f(x + th) =h''h2; i, f(x + th). 
By induction we obtain 
pe 1) = (hd +72) fe + th) sh! hl, iy FO + th) 


(summation over all sets i1,..., ix of k indices, each assuming the values | and 2, 
is meant). 


Example 2 If f(x) = f(x!,...,x) and f ¢ C®(G; R), then, under the assump- 
tion that [x, x +h] Cc G, for the function g(t) = f(x + th) defined on the closed 
interval [0, 1] we obtain 


g(t) =hi--- WG; i, f (x + th), (8.58) 
where summation over all sets of indices i;,..., 7%, each assuming all values from 


1 to m inclusive, is meant on the right. 
We can also write formula (8.58) as 


g(t) = (hla, +e +H In)” fa + th). (8.59) 
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8.4.4 Taylor’s Formula 


Theorem 4 /f the function f : U(x) > R is defined and belongs to class C™ (U(x); 
R) in a neighborhood U(x) C R” of the point x € R”, and the closed interval 
[x, x +h] is completely contained in U(x), then the following equality holds: 


F(xi +aly.. x +h”) — f(xl. x)= 
“(181 th am) fe) + rn—1 0% A), (8.60) 


where 


Gao" mg 
rn 103 n= [& EL (n'a, +++» +A"m)" f(x +th)dt. (8.61) 


Equality (8.60), together with (8.61), is called Taylor’s formula with integral 
form of the remainder. 


Proof Taylor’s formula follows immediately from the corresponding Taylor formula 
for a function of one variable. In fact, consider the auxiliary function 


p(t) = f(x + th), 


which, by the hypotheses of Theorem 4, is defined on the closed interval 0 < t < 1 
and (as we have verified above) belongs to the class C 10, 1]. 

Then for t € [0, 1], by Taylor’s formula for functions of one variable, we can 
write that 


1 1 
g(t) =9O + BP'Ort+ + ——o* VO" + 


(n—1)! 
d-1" 7 2 we go” n 
+f < “GDL (tt)t” dt. 
Setting t = | here, we obtain 
o(1) = 9) + =o!) +--+ "PO + 

1! (n—1)! 
aor" _ tyr} ‘ 

+f & “Gap? g(t) dt. (8.62) 


Substituting the values 


v0) = (n'a, feet h™ In)’ fx) (k=0,...,n—1), 
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p(t) = (Ald, +--+ +h" In)" fx + th), 


into this equality in accordance with formula (8.59), we find what Theorem 4 as- 
serts. 


Remark If we write the remainder term in relation (8.62) in the Lagrange form 
rather than the integral form, then the equality 


|, (n-1) La) 
hoe ama A ay 7) Os? (8), 


(n—1)! 


where 0 < 6 < 1, implies Taylor’s formula (8.60) with remainder term 
1 
maGehy= — (hia +e +h mn)" fe +h). (8.63) 


This form of the remainder term, as in the case of functions of one variable, is 
called the Lagrange form of the remainder term in Taylor’ s formula. 
Since f € C™ (U(x); R), it follows from (8.63) that 


1 
Pn—1(3 A) = (hd +++ + Am)" FX) + 0( AI") as h > 0, 
and so we have the equality 
f(xttal,...,x™ +h”) — f(xl,.. x)= 
n 1 " 
= Do gt ate thn) f (x) + o( IA") as h > 0, (8.64) 
k=1 


called Taylor’s formula with the remainder term in Peano form. 


8.4.5 Extrema of Functions of Several Variables 


One of the most important applications of differential calculus is its use in finding 
extrema of functions. 


Definition 1 A function f : E > R defined ona set E C R” has a local maximum 
(resp. local minimum) at an interior point x9 of EF if there exists a neighborhood 
U(xo) C E of the point xp such that f(x) < f(xo) (resp. f(x) => f(xo)) for all 
x € U(x0). 

If the strict inequality f(x) < f(xo) holds for x € U(x) \ xo (or, respectively, 
f(x) > f(xo)), the function has a strict local maximum (resp. strict local minimum) 
at Xo. 
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Definition 2 The local minima and maxima of a function are called its local ex- 
trema. 


Theorem 5 Suppose a function f : U (xo) > R defined in a neighborhood U (xo) C 
R” of the point x9 = (ees ...,X9') has partial derivatives with respect to each of the 
variables x!,...,x"" at the point xo. 

Then a necessary condition for the function to have a local extremum at xo is that 
the following equalities hold at that point: 


a a 
oer i 7(%o)=0, ..., 2 eye (8.65) 
Xx x 


Proof Consider the function g(x!) = f Coes ...,Xg') Of one variable defined, 
according to the hypotheses of the theorem, in some neighborhood of the point an 
on the real line. At a the function g(x!) has a local extremum, and since 


it follows that AL (x0) =0. 
The other equalities in (8.65) are proved similarly. 


We call attention to the fact that relations (8.65) give only necessary but not 
sufficient conditions for an extremum of a function of several variables. An example 
that confirms this is any example constructed for this purpose for functions of one 
variable. Thus, where previously we spoke of the function x +> x*, whose derivative 
is zero at zero, but has no extremum there, we can now consider the function 


tae aa) = (x1)°, 


all of whose partial derivatives are zero at x9 = (0,..., 0), while the function obvi- 
ously has no extremum at that point. 

Theorem 5 shows that if the function f : G — R is defined on an open set 
G CR”, its local extrema are found either among the points at which f is not 
differentiable or at the points where the differential df (xo) or, what is the same, the 
tangent mapping f’(xo), vanishes. 

We know that if a mapping f : U(xo) > R” defined in a neighborhood U (xo) C 
R” of the point xo € R” is differentiable at xo, then the matrix of the tangent map- 
ping f’(xo) : R” — R” has the form 


a! f(x) ies am f (x0) 
: ey : (8.66) 
Oi f"(%o) +++ Om f" (x0) 
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Definition 3 The point xq is a critical point of the mapping f : U(xo) > R" if the 
rank of the Jacobi matrix (8.66) of the mapping at that point is less than min{m, n}, 
that is, smaller than the maximum possible value it can have. 

In particular, if nm = 1, the point xg is critical if condition (8.65) holds, that is, all 
the partial derivatives of the function f : U(xo) > R vanish. 

The critical points of real-valued functions are also called the stationary points 
of these functions. 

After the critical points of a function have been found by solving the system 
(8.65), the subsequent analysis to determine whether they are extrema or not can 
often be carried out using Taylor’s formula and the following sufficient conditions 
for the presence or absence of an extremum provided by that formula. 


Theorem 6 Let f : U(xo) > R be a function of class C®(U (xo); R) defined in a 
neighborhood U (x9) C R™ of the point x9 = igs .++,%9) € R”, and let xo be a 
critical point of the function f . 

If, in the Taylor expansion of the function at the point xo 


F(xo tal... +h") = 


: m Lyn Pf ipi 2 
=F lgecsnian ad aaa (xo)h'h! + o(||hII7) (8.67) 


the quadratic form 


Oe ve af 
De Fpiggy CMH = dij feoyh'h! (8.68) 
i,j °* O* 


a) is positive-definite or negative-definite, then the point xo has a local extremum 
at xo, which is a strict local minimum if the quadratic form (8.68) is positive-definite 
and a Strict local maximum if it is negative-definite; 

b) assumes both positive and negative values, then the function does not have an 
extremum at Xo. 


Proof Leth £0 and x9 +h € U(xo). Let us represent (8.67) in the form 


Lol ey of hi hi 
fo +h) — fo) = 5 Hall » aciaal (xo) Al Al + 2} (8.69) 
where o(1) is infinitesimal as h > 0. 

It is clear from (8.69) that the sign of the difference f(xp +h) — f (xq) is com- 
pletely determined by the sign of the quantity in brackets. We now undertake to 
study this quantity. 

The vector e = (h!/||h||,..., 4”/||h||) obviously has norm 1. The quadratic form 
(8.68) is continuous as a function h € R”, and therefore its restriction to the unit 
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sphere S(0; 1) = {x € R”™|||x|| = 1} is also continuous on $(0; 1). But the sphere S 
is a closed bounded subset in R”, that is, it is compact. Consequently, the form 
(8.68) has both a minimum point and a maximum point on S, at which it assumes 
respectively the values m and M. 

If the form (8.68) is positive-definite, then 0 < m < M, and there is a number 
6 > 0 such that |o(1)| < m for ||h|| <6. Then for ||h|| < 6 the bracket on the right- 
hand side of (8.69) is positive, and consequently f(x9 +h) — f(xo) > 0 for 0 < 
||A|| < 5. Thus, in this case the point x9 is a strict local minimum of the function. 

One can verify similarly that when the form (8.68) is negative-definite, the func- 
tion has a strict local maximum at the point x9. 

Thus a) is now proved. 

We now prove b). 

Let e,, and ey be points of the unit sphere at which the form (8.68) assumes the 
values m and M respectively, and letm <0 < M. 

Setting h = te,,, where ¢ is a sufficiently small positive number (so small that 
xo + tem € U(x0)), we find by (8.69) that 


I 2 
F (Xo + tem) — F(x0) = st (m + 0(1)), 


where o(1) > 0 as t — O. Starting at some time (that is, for all sufficiently small 
values of ft), the quantity m+ o(1) on the right-hand side of this equality will have 
the sign of m, that is, it will be negative. Consequently, the left-hand side will also 
be negative. 

Similarly, setting h = tey, we obtain 


1 4 
f(xo + tem) — f (x0) = ait (M + o0(1)), 


and consequently for all sufficiently small ¢ the difference f (xo + tey) — f (xo) is 
positive. 

Thus, if the quadratic form (8.68) assumes both positive and negative values on 
the unit sphere, or, what is obviously equivalent, in R”, then in any neighborhood 
of the point xo there are both points where the value of the function is larger than 
jf (xo) and points where the value is smaller than f (xo). Hence, in that case xg is 
not a local extremum of the function. 


We now make a number of remarks in connection with this theorem. 


Remark 1 Theorem 6 says nothing about the case when the form (8.68) is semi- 
definite, that is, nonpositive or nonnegative. It turns out that in this case the point 
may be an extremum, or it may not. This can be seen, in particular from the follow- 
ing example. 


Example 3 Let us find the extrema of the function f(x, y) = x*+ + y+ — 2x?, which 
is defined in R?. 
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In accordance with the necessary conditions (8.65) we write the system of equa- 
tions 
af 


—(x, y) =4x? — 4x =0, 
ox 


0 
BT: (3 ext 1, 
dy 


from which we find three critical points: (—1, 0), (0, 0), (1, 0). 


Since 
a f 2 ei 0° f 2 
—z (x, y) = 12x* —4, ,y)=0, ,y) = 12y", 
5x2 (x,y) x dxdy (x, y) dy2 (x,y) y 


at the three critical points the quadratic form (8.68) has respectively the form 
g(a)’, a(n’), a(n!) 


That is, in all cases it is positive semi-definite or negative semi-definite. Theorem 6 
is not applicable, but since f(x, y) = (x* — 1)? + y* — 1, it is obvious that the 
function f(x, y) has a strict minimum —1 (even a global minimum) at the points 
(—1, 0), and (1, 0), while there is no extremum at (0, 0), since for x = 0 and y £0, 
we have f(0, y) = y* > 0, and for y = 0 and sufficiently small x 4 0 we have 
f(x, 0) = x4 — 2x? <0. 


Remark 2 After the quadratic form (8.68) has been obtained, the study of its def- 
initeness can be carried out using the Sylvester’ criterion. We recall that by the 


Sylvester criterion, a quadratic form pe aj jxixd with symmetric matrix 


Qitoo+t* Aim 


Ami *** Gmm 


is positive-definite if and only if all its principal minors are positive; the form is 
negative-definite if and only if a1; < 0 and the sign of the principal minor reverses 
each time its order increases by one. 

Example 4 Let us find the extrema of the function 


f(x,y) =xyln(x? + y’), 


which is defined everywhere in the plane R? except at the origin. 


7J.J. Sylvester (1814-1897) — British mathematician. His best-known works were on algebra. 
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Solving the system of equations 


of 2 4 On y 
ay 9) = yIn(x dt ope 
of 4 2 2xy* 
Bo ee ty) + 7 =O 


we find all the critical points of the function 


1 1 1 1 
1): 1,0): eae eee) ae eee ee 
a ( J2e° =) ( J2e° =) 


Since the function is odd with respect to each of its arguments individually, the 
points (0, +1) and (+1, 0) are obviously not extrema of the function. 

It is also clear that this function does not change its value when the signs of both 
variables x and y are changed. Thus by studying only one of the remaining critical 
points, for example, (Fe Tm) we will be able to draw conclusions on the nature 


of the others. 


Since 

af ) 6xy Ax3y 

Saas = ’ 

ax2 5324 2 2+ y?? 

0° f Find 4x*y? 
andy (x, y) =In(x* +h*) +2 - Gaye 
a’ f 6xy 4xy? 

aa Oa 8 Fear gle? 

dy x*+y (x* + y*) 


at the point (Fe Tm) the quadratic form 9;; f (xo) hih/ has the matrix 


(3 2): 


that is, it is positive-definite, and consequently at that point the function has a local 
minimum 


1 1 1 
(= =) Qe" 
By the observations made above on the properties of this function, one can con- 
clude immediately that 


Fey) 


is also a local minimum and 


Tega) a va) 
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are local maxima of the function. This, however, could have been verified directly, 
by checking the definiteness of the corresponding quadratic form. For example, at 
the point (— Te Te the matrix of the quadratic form (8.68) has the form 


—2 O 
0 -2)? 
from which it is clear that it is negative-definite. 


Remark 3 It should be kept in mind that we have given necessary conditions (Theo- 
rem 5) and sufficient conditions (Theorem 6) for an extremum of a function only at 
an interior point of its domain of definition. Thus in seeking the absolute maximum 
or minimum of a function, it is necessary to examine the boundary points of the 
domain of definition along with the critical interior points, since the function may 
assume its maximal or minimal value at one of these boundary points. 

The general principles of studying noninterior extrema will be considered in 
more detail later (see the section devoted to extrema with constraint). It is useful to 
keep in mind that in searching for minima and maxima one may use certain simple 
considerations connected with the nature of the problem along with the formal tech- 
niques, and sometimes even instead of them. For example, if a differentiable func- 
tion being studied in R” must have a minimum because of the nature of the problem 
and turns out to be unbounded above, then if the function has only one critical point, 
one can assert without further investigation that that point is the minimum. 


Example 5 (Huygens’ problem) On the basis of the laws of conservation of energy 
and momentum of a closed mechanical system one can show by a simple computa- 
tion that when two perfectly elastic balls having mass m, and mg and initial veloc- 
ities vy and v2 collide, their velocities after a central collision (when the velocities 
are directed along the line joining the centers) are determined by the relations 


- (m, —m2)v; + 2m2v2 
U’u= , 
m, +m 


_ (m2 — mj)v2 + 2m, v4 
my +m , 


U2 


In particular, if a ball of mass M moving with velocity V strikes a motionless ball 
of mass m, then the velocity v acquired by the latter can be found from the formula 


2M 
v= V, (8.70) 
m+M 


from which one can see that if 0 < m < M, then V <v <2V. 

How can a significant part of the kinetic energy of a larger mass be communi- 
cated to a body of small mass? To do this, for example, one can insert balls with 
intermediate masses between the balls of small and large mass: m < im, < m2 < 

-< my, < M. Let us compute (after Huygens) how the masses m1,m2,...,™n 
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should be chosen to that the body m will acquire maximum velocity after succes- 
sive central collisions. 
In accordance with formula (8.70) we obtain the following expression for the 


required velocity as a function of the variables mj, m2,..., mn: 
m m m M 
v= a Po ieaey 7 Sgtly. (8.71) 
m+m, m+m2 Mp-1 +My M,+M 


Thus Huygens’ problem reduces to finding the maximum of the function 


m4 My M 
f(mi,...,M™n) = ———--::: . : 
m+m Myn-1 +m, m,+M 


The system of equations (8.65), which gives the necessary conditions for an in- 
terior extremum, reduces to the following system in the present case: 


m-mz—m;=0, 


my, +m3 — m3 =0, 


2 
Mn-1 _M-—m;, =0, 


from which it follows that the numbers m,m,...,m™n,,M form a geometric pro- 
gression with ratio g equal to "/M/m. 
The value of the velocity (8.71) that results from this choice of masses is given 


by 
2q n+1 
v= (4) V, (8.72) 
l+q 


which agrees with (8.70) ifn = 0. 

It is clear from physical considerations that formula (8.72) gives the maximal 
value of the function (8.71). However, this can also be verified formally (without 
invoking the cumbersome second derivatives. See Problem 9 at the end of this sec- 
tion). 

We remark that it is clear from (8.72) that if m — 0, then v > 2”+!V. Thus 
the intermediate masses do indeed significantly increase the portion of the kinetic 
energy of the mass M that is transmitted to the small mass m. 


8.4.6 Some Geometric Images Connected with Functions 
of Several Variables 


a. The Graph of a Function and Curvilinear Coordinates 


Let x, y, and z be Cartesian coordinates of a point in R? and let z= f(x, y) bea 
continuous function defined in some domain G of the plane R? of the variables x 
and y. 
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By the general definition of the graph of a function, the graph of the function 
f :G— R in our case is the set S = {(x, y,z) € R* | (x, y) € G,z= f(x, y)} in 
the space R?3. 


It is obvious that the mapping G a S defined by the relation (x, y) & 
(x, y, f(x, y)) is a continuous one-to-one mapping of G onto S, by which one 
can determine every point of S by exhibiting the point of G corresponding to it, or, 
what is the same, giving the coordinates (x, y) of this point of G. 

Thus the pairs of numbers (x, y) € G can be regarded as certain coordinates of 
the points of a set S — the graph of the function z = f (x, y). Since the points of S are 
given by pairs of numbers, we shall conditionally agree to call S a two-dimensional 
surface in R°. (The general definition of a surface will be given later.) 

If we define a path I”: J > G in G, then a path F o ’: I > S automatically 
appears on the surface S. If x = x(t) and y = y(f) is a parametric definition of 
the path I”, then the path F oI’ on S is given by the three functions x = x(t), 
y= y(t), z= z(t) = f(x(t), y(t)). In particular, if we set x = x9 + tf, y = Yo, we 
obtain a curve x = x9 +f, y= yo, Z= f(xo +f, yo) on the surface S along which 
the coordinate y = yo of the points of S does not change. Similarly one can exhibit a 
curve x = x9, y= yot t, z= f (Xo, Yo +4) on S along which the first coordinate xo 
of the points of S does not change. By analogy with the planar case these curves on 
S are naturally called coordinate lines on the surface S$. However, in contrast to the 
coordinate lines in G C R?, which are pieces of straight lines, the coordinate lines 
on S are in general curves in R?. For that reason, the coordinates (x, y) of points of 
the surface S are often called curvilinear coordinates on S. 

Thus the graph of a continuous function z = f(x, y), defined in a domain G C 
R? is a two-dimensional surface S in R* whose points can be defined by curvilinear 
coordinates (x, y) € G. 

At this point we shall not go into detail on the general definition of a surface, 
since we are interested only in a special case of a surface — the graph of a func- 
tion. However, we assume that from the course in analytic geometry the reader is 
well acquainted with some important particular surfaces in R? (such as a plane, an 
ellipsoid, paraboloids, and hyperboloids). 


b. The Tangent Plane to the Graph of a Function 


Differentiability of a function z= f(x, y) at the point (xo, yo) € G means that 


f(x, y) = f (xo, yo) + AC — x0) + B(y — yo) + 


+ o(y/@ — 20)? +09 — yo?) a8.) > ono), (8-73) 


where A and B are certain constants. 
In R? let us consider the plane 


Z=7z0 + A(x — x0) + Bly — yo), (8.74) 
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where zo = f (xo, yo). Comparing equalities (8.73) and (8.74), we see that the graph 
of the function is well approximated by the plane (8.74) in a neighborhood of the 
point (xo, yo, Zo). More precisely, the point (x, y, f(x, y)) of the graph of the func- 
tion differs from the point (x, y, z(x, y)) of the plane (8.74) by an amount that is 
infinitesimal in comparison with the magnitude a (x — xo)? + (y — yo)? of the dis- 
placement of its curvilinear coordinates (x, y) from the coordinates (xg, yo) of the 
point (x0, yo. Zo). 

By the uniqueness of the differential of a function, the plane (8.74) possessing 
this property is unique and has the form 


of of 
z= f (xo, yo) + 3x tO yo) (x — xo) + (Xo, yo)(y — yo). (8.75) 
x dy 


This plane is called the tangent plane to the graph of the function z = f (x, y) at the 
point (xo, yo, f (xo, yo)). 

Thus, the differentiability of a function z = f(x,y) at the point (x9, yo) 
and the existence of a tangent plane to the graph of this function at the point 
(xo, yo, f (Xo, yo)) are equivalent conditions. 


c. The Normal Vector 


Writing Eq. (8.75) for the tangent plane in the canonical form 
0 0 
LC, yo)(x — x0) + set, yo)(y — yo) — (2 — f (xo, yo)) = 0, 


we conclude that the vector 


0 0 
(Zo. jel tae: -1) (8.76) 
0 dy 


Xx 


is the normal vector to the tangent plane. Its direction is considered to be the direc- 
tion normal or orthogonal to the surface S (the graph of the function) at the point 
(xo, yo, f (x0, yo)). 

In particular, if (xo, yo) is a critical point of the function f(x, y), then the normal 
vector to the graph at the point (xo, yo, f (xo, yo)) has the form (0, 0, —1) and con- 
sequently, the tangent plane to the graph of the function at such a point is horizontal 
(parallel to the x y-plane). 

The three graphs in Fig. 8.1 illustrate what has just been said. 

Figures 8.la and c depict the location of the graph of a function with respect to 
the tangent plane in a neighborhood of a local extremum (minimum and maximum 
respectively), while Fig. 8.1b shows the graph in the neighborhood of a so-called 
saddle point. 
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z 


Fig. 8.1 


d. Tangent Planes and Tangent Vectors 


We know that if a path I : J — R? in R? is given by differentiable functions x = 
x(t), y = y(t), z = z(t), then the vector (x(0), y(0), z(0)) is the velocity vector at 
time ¢ = 0. It is a direction vector of the tangent at the point x9 = x(0), yo = y(0), 
z9 = 2(0) to the curve in R? that is the support of the path I’. 

Now let us consider a path I”: J — S on the graph of a function z= f(x, y) 
given in the form x = x(t), y= y(t), z= f(x(), y(t)). In this particular case we 
find that 


ay ae cont adae Tr ; af : 
(%(0), (0), 2(0)) = (so. ¥(0), Go, yo)*(0) + By OO v0)50)). 


from which it can be seen that this vector is orthogonal to the vector (8.76) nor- 
mal to the graph S of the function at the point (xo, yo, f (xo, yo)). Thus we have 
shown that if a vector (€, 7, €) is tangent to a curve on the surface S at the point 
(xo, yo, f (xo, Yo)), then it is orthogonal to the vector (8.76) and (in this sense) lies 
in the plane (8.75) tangent to the surface S at the point in question. More precisely 
we could say that the whole line x = x9 + €t, y= yo + nt, z= f (x0, yo) + ft lies 
in the tangent plane (8.75). 

Let us now show that the converse is also true, that is, if a line x = x9 + &t, 
y=yotnt,z= f (xo, yo) + ft, or what is the same, the vector (€, 7, ¢), lies in the 
plane (8.75), then there is a path on S for which the vector (&€, 7, €) is the velocity 
vector at the point (xo, yo, f (Xo, yo))- 

The path can be taken, for example, to be 


x=xot+ét, y=yo+nt, z= f(xo+ &t, yo+ nf). 


In fact, for this path, 
: , : 0 a 
x(0) =, y(O) =n, z(0) = L(x, yo)é + Ge yo): 
x dy 
In view of the equality 


rd 0 
aon yo)*(O) + oF yo) ¥(0) — z(0) =0 
Ox oy 
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Fig. 8.2 


(> 


and the hypothesis that 


0 0 
L(x, yo)§ + Ge yon —F =0. 
x dy 


We conclude that 


(%(0), 3(0), 2(0)) = &, n, £). 


Hence the tangent plane to the surface S at the point (xo, yo, zo) is formed by the 
vectors that are tangents at the point (xg, yo, Zo) to curves on the surface S passing 
through the point (see Fig. 8.2). 

This is a more geometric description of the tangent plane. In any case, one can 
see from it that if the tangent to a curve is invariantly defined (with respect to the 
choice of coordinates), then the tangent plane is also invariantly defined. 

We have been considering functions of two variables for the sake of visualiz- 
ability, but everything that was said obviously carries over to the general case of a 
function 


ice ae (8.77) 


of m variables, where m € N. 
At the point eae anngdg od ene ..+,X9')) the plane tangent to the graph of such 
a function can be written in the form 


a ; 
y= Fp. scg he) $y A cengty Je — x6); (8.78) 


the vector 


0 0 
(Fav. otto) =1) 


ne oxm 


is the normal vector to the plane (8.78). This plane itself, like the graph of the func- 
tion (8.77), has dimension m, that is, any point is now given by a set (x!,..., x") 
of m coordinates. 

Thus, Eq. (8.78) defines a hyperplane in R”*!. 

Repeating verbatim the reasoning above, one can verify that the tangent plane 
(8.78) consists of vectors that are tangent to curves passing through the point 
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Cas dQ f (a5: ...,X9')) and lying on the m-dimensional surface S — the graph 
of the function (8.77). 


8.4.7 Problems and Exercises 


1. Let z= f(x, y) bea function of class C“!)(G; R). 


a) ir’ By F(x, y) = 0 in G, can one assert that f is independent of y in G? 
b) Under what condition on the domain G does the preceding question have an 
affirmative answer? 
2. a) Verify that for the function 
Po te ge 
XVao 5, fx t+ 0, 
fa, ya) ue 
0, if x7 + y? =0, 


the following relations hold: 


of a f 
——(0,0)=14-1l= 
axdy ‘ ) - dvd 


b) Prove that if the function f(x, y) has partial derivatives 2 ay and 3 in some 


neighborhood U of the point (xo, yo), and if the mixed derivative i ae inh) 


exists in U and is continuous at (xo, yo), then the mixed derivative « £ (resp. “fy 
also exists at that point and the following equality holds: 


a2 2 
a f a f 
x0, . 
Ardy O20 = ayax 70" 70) 
3. Let x!,..., x” be Cartesian coordinates in R”. The differential operator 
m g2 
ve axi?’ 


2 
a F(x! eae) 


is called the Laplacian. 

The equation Af = 0 for the function f in the domain G C R” is called 
Laplace’s equation, and its solutions are called harmonic functions in the do- 
main G. 
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a) Show that if x = (x!,..., x”) and 


I|x || = 
then for m > 2 the function 
_2om 
f(x) = |x|] 2 
is harmonic in the domain R”\0, where 0 = (0,..., 0). 
b) Verify that the function 
1 Ilx r) 
1 m 
Merete SES + eX ‘ 
f( ) (2a./mt)™ o( 4at 
which is defined for t > 0 and x = (x!,..., x”) € R™, satisfies the heat equation 
af 9 
— =a*Af, 
ot eas 


‘ a4 
that is, verify that ar Say 5 at each point of the domain of definition of 
the function. 


4. Taylor’ s formula in multi-index notation. The symbol @ := (a1, ..., @m) consist- 
ing of nonnegative integers aj, i = 1,...,m, is called the multi-index a. 
The following notation is conventional: 


la] <= ay +--+ +am, 


a!:=ay!++-+Am); 


finally, if a = (a1,..., dm), then 


a? = ae ae 
a) Verify that if k € N, then 
k k} ay Om 
(a, +++: +m) ae 1 71 an > 
isk a) ‘Am 
or 
k! 
(a) +++++am)k = Y* — a", 

Diee a! 

where the summation extends over all sets a = (a, ..., @m) of nonnegative integers 


such that 77") aj =k. 
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b) Let 


alel f 
(dxl)@1 at (0x1 )%m 


Show that if f ¢ C“(G; R), then the equality 


D* f (x) := 


(x). 


a k! 
YO FOOKE HES a eS ayn”, 


ip te-t+im=k lja|=k 


where h = (h!, ..., A"), holds at any point x € G. 
c) Verify that in multi-index notation Taylor’s theorem with the Lagrange form 
of the remainder, for example, can be written as 


n—1 


1 1 
feth=> — Df (x)h® + > De F(x + Ohh. 


|o|=0 la|=n 


d) Write Taylor’s formula in multi-index notation with the integral form of the 
remainder (Theorem 4). 


5. a) Let 7” ={x =(«!,...,x”) ER” | |x!| <c!,i =1,...,m)} be an m-dimen- 
sional closed interval and J a closed interval [a, b] C R. Show that if the function 
fay=fo 1x, y) is defined and continuous on the set /” x J, then for any 


positive number ¢ > 0 there exists a number 6 > 0 such that | f(x, y1) — f(x, y2)| < 
eifx el”, y1, y2 €1, and |y — y2| <6. 
b) Show that the function 


b 
Fa)= [ f(x, y) dy 


is defined and continuous on the closed interval J”. 
c) Show that if f € CU; R), then the function 


F (x,t) = f(tx) 


is defined and continuous on J” x I', where I! = {t € R| |t| < 1}. 

d) Prove Hadamard’ s lemma: 

If f €CYU™;R) and f (0) = 0, there exist functions g1,...,%m € C(I; R) 
such that 


m 
Pe ae) =) ea et”) 
i=l 


in I™, and in addition 


af 


si =, 


(0), iml,...,m. 
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6. Prove the following generalization of Rolle’s theorem for functions of several 
variables. 

If the function f is continuous in a closed ball B(O;r), equal to zero on the 
boundary of the ball, and differentiable in the open ball B(O; r), then at least one of 
the points of the open ball is a critical point of the function. 

7. Verify that the function 


f(x,y) =(y —x7)(y — 3x7) 


does not have an extremum at the origin, even though its restriction to each line 
passing through the origin has a strict local minimum at that point. 

8. The method of least squares. This is one of the commonest methods of processing 
the results of observations. It consists of the following. Suppose it is known that the 
physical quantities x and y are linearly related: 


y=ax+b (8.79) 


or suppose an empirical formula of this type has been constructed on the basis of 
experimental data. 

Let us assume that n observations have been made, in each of which both x and y 
were measured, resulting in n pairs of values x1, y1;...; Xn, Yn. Since the measure- 
ments have errors, even if the relation (8.79) is exact, the equalities 


VR =axp +b 


may fail to hold for some of the values of k € {1,...,}, no matter what the coeffi- 
cients a and b are. 

The problem is to determine the unknown coefficients a and b in a reasonable 
way from these observational results. 

Basing his argument on analysis of the probability distribution of the magnitude 
of observational errors, Gauss established that the most probable values for the co- 
efficients a and b with a given set of observational results should be sought by use 
of the following least-squares principle: 

Tf 5x = (ax +b) — yx is the discrepancy in the kth observation, then a and b 
should be chosen so that the quantity 


A= 5782, 
k=1 


that is, the sum of the squares of the discrepancies, has a minimum. 
a) Show that the least-squares principle for relation (8.79) leads to the following 


system of linear equations 


[xxn, xe ]a + [xx, Lb =[xx, ye], 
[1,xJa +1, 1Jb=(1, ye], 
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Table 8.1 

Temperature, °C Frequency, % Temperature, °C Frequency, % 
0 39 20 136 
5 54 25 182 

10 74 30 254 

15 100 


for determining the coefficients a and b. Here, following Gauss, we write [xx, x4] := 
XyXy tess + XnXn, (Xe, Y= x1 - 14+ --+ + xn- 1, ee, ye] = x1 y1 +++ + Xn yn, and 
so forth. 

b) Write the system of equations for the numbers aj,..., 4 ,,b to which the 
least-squares principle leads when Eq. (8.79) is replaced by the relation 


m 
y= = a;x! +b, 
i=1 
(or, more briefly, y = a;x! +b) between the quantities x',...,x’" and y. 
c) How can the method of least squares be used to find empirical formulas of 
the form 


YS Oxy! +++," 
connecting physical quantities x,,..., x With the quantity y? 

d) (M. Germain) The frequency R of heart contractions was measured at differ- 
ent temperatures T in several dozen specimens of Nereis diversicolor. The frequen- 
cies were expressed in percents relative to the contraction frequency at 15 °C. The 
results are given in Table 8.1. 

The dependence of R on T appears to be exponential. Assuming R = Ae?’, find 
the values of the constants A and b that best fit the experimental results. 


9. a) Show that in Huygens’ problem, studied in Example 5, the function (8.71) 
tends to zero if at least one of the variables m1, ..., m, tends to infinity. 

b) Show that the function (8.71) has a maximum point in R” and hence the 
unique critical point of that function in R” must be its maximum. 

c) Show that the quantity v defined by formula (8.72) is monotonically increas- 
ing as n increases and find its limit as n > oo. 


10. a) During so-called exterior disk grinding the grinding tool — a rapidly rotating 
grinding disk (with an abrasive rim) that acts as a file — is brought into contact with 
the surface of a circular machine part that is rotating slowly compared with the disk 
(see Fig. 8.3). 

The disk K is gradually pressed against the machine part D, causing a layer H of 
metal to be removed, reducing the part to the required size and producing a smooth 
working surface for the device. In the machine where it will be placed this surface 
will usually be a working surface. In order to extend its working life, the metal of the 
machine part is subjected to a preliminary annealing to harden the steel. However, 
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Fig. 8.3 


because of the high temperature in the contact zone between the machine part and 
the grinding disk, structural changes can (and frequently do) occur in a certain layer 
A of metal in the machine part, resulting in decreased hardness of the steel in that 
layer. The quantity A is a monotonic function of the rate s at which the disk is 
applied to the machine part, that is, A = g(s). It is known that there is a certain 
critical rate sg > O at which the relation A = 0 still holds, while A > 0 whenever 
5 > So. For the following discussion it is convenient to introduce the relation 


s=y(A) 


inverse to the one just given. This new relation is defined for A > 0. 

Here yw is a monotonically increasing function known experimentally, defined 
for A > 0, and w(0) = s9 > 0. 

The grinding process must be carried out in such a way that there are no structural 
changes in the metal on the surface eventually produced. 

In terms of rapidity, the optimal grinding mode under these conditions would 
obviously be a set of variations in the rate s of application of the grinding disk for 
which 


s=y(6), 


where 6 = 6(ft) is the thickness of the layer of metal not yet removed up to time f, or, 
what is the same, the distance from the rim of the disk at time f to the final surface 
of the device being produced. Explain this. 

b) Find the time needed to remove a layer of thickness H when the rate of ap- 
plication of the disk is optimally adjusted. 

c) Find the dependence s = s(t) of the rate of application of the disk on time 


in the optimal mode under the condition that the function A ees s is linear: s = 
SOTAA. 

Due to the structural properties of certain kinds of grinding lathes, the rate s can 
undergo only discrete changes. This poses the problem of optimizing the produc- 
tivity of the process under the additional condition that only a fixed number n of 
switches in the rate s are allowed. The answers to the following questions give a 
picture of the optimal mode. 
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d) What is the geometric interpretation of the grinding time t(H) = ns aa 
that you found in part b) for the optimal continuous variation of the rate s? 

e) What is the geometric interpretation of the time lost in switching from the 
optimal continuous mode of variation of s to the time-optimal stepwise mode of 
variation of s? 

f) Show that the points 0 = x)41 <x, <--: <x, <xo =A of the closed inter- 
val [0, H] at which the rate should be switched must satisfy the conditions 


1 1 1\/ 
= DQG — Xi- i=1,..., 
Wiig) Wi) (=) (xi) (xi —xi-1) @ n) 


and consequently, on the portion from x; to xj+1, the rate of application of the disk 
has the form s = W(4j+41) @ =0,...,7). 

g) Show that in the linear case, when w(A) = so + AA, the points x; (in part f)) 
on the closed interval [0, H] are distributed so that the numbers 


form a geometric progression. 


11. a) Verify that the tangent to a curve I": J > R” is defined invariantly relative 
to the choice of coordinate system in R”. 

b) Verify that the tangent plane to the graph S of a function y = f(x!,..., x”) 
is defined invariantly relative to the choice of coordinate system in R”. 

c) Suppose the set S CR” x R! is the graph of a function y = f (x!,..., x”) 
in coordinates (x!,...,x’", y) in R” x R! and the graph of a function j = 
Fo, ...,’”) in coordinates (x!,...,¥’”, ¥) in R” x R!. Verify that the tangent 
plane to S is invariant relative to a linear change of coordinates in R’” x R!. 

d) Verify that the Laplacian Af = 7", Fh ox) is defined invariantly relative 
to orthogonal coordinate transformations in R”. 


8.5 The Implicit Function Theorem 


8.5.1 Statement of the Problem and Preliminary Considerations 


In this section we shall prove the implicit function theorem, which is important both 
intrinsically and because of its numerous applications. 

Let us begin by explaining the problem. Suppose, for example, we have the rela- 
tion 


Pay = 10 (8.80) 


between the coordinates x, y of points in the plane R?. The set of all points of R? 
satisfying this condition is the unit circle (Fig. 8.4). 
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Fig. 8.4 7] 
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The presence of the relation (8.80) shows that after fixing one of the coordinates, 
for example, x, we can no longer choose the second coordinate arbitrarily. Thus 
relation (8.80) determines the dependence of y on x. We are interested in the ques- 
tion of the conditions under which the implicit relation (8.80) can be solved as an 
explicit functional dependence y = y(x). 

Solving Eq. (8.80) with respect to y, we find that 


= +1 — x2, (8.81) 


that is, to each value of x such that |x| < 1, there are actually two admissible values 
of y. In forming a functional relation y = y(x) satisfying relation (8.80) one cannot 
give preference to either of the values (8.81) without invoking additional require- 
ments. For example, the function y(x) that assumes the value +1 — x? at rational 
points of the closed interval [—1, 1] and the value —/1 — x? at irrational points 
obviously satisfies (8.80). 

It is clear that one can create infinitely many functional relations satisfying (8.80) 
by varying this example. 

The question whether the set defined in R? by (8.80) is the graph of a function 
y = y(x) obviously has a negative answer, since from the geometric point of view 
it is equivalent to the question whether it is possible to establish a one-to-one direct 
projection of a circle into a line. 

But observation (see Fig. 8.4) suggests that nevertheless, in a neighborhood of a 
particular point (x9, yo) the arc projects in a one-to-one manner into the x-axis, and 
that it can be represented uniquely as y = y(x), where x +» y(x) is a continuous 
function defined in a neighborhood of the point x9 and assuming the value yo at xo. 
In this aspect, the only bad points are (—1, 0) and (1, 0), since no arc of the circle 
having them as interior points projects in a one-to-one manner into the x-axis. Even 
so, neighborhoods of these points on the circle are well situated relative to the y- 
axis, and can be represented as the graph of a function x = x(y) that is continuous 
in a neighborhood of the point 0 and assumes the value —1 or 1 according as the arc 
in question contains the point (—1, 0) or (1, 0). 

How is it possible to find out analytically when a geometric locus of points de- 
fined by a relation of the type (8.80) can be represented in the form of an explicit 
function y = y(x) or x = x(y) in a neighborhood of a point (xo, yo) on the locus? 
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We shall discuss this question using the following, now familiar, method. We 
have a function F(x, y) = 4 y? — 1. The local behavior of this function in a 
neighborhood of a point (xo, yo) is well described by its differential 


F,.(xo, yo) (x — x0) + Fy (x0, yo)(y — yo), 


since 


F(x, y) = F(xo, yo) + Fy (x0, yo)(x — x0) + 
+ Fy (x0, yo)(y — yo) + o(|x — xol + ly — yol) 


as (x, y) > (XQ, yo). 
If F (xo, yo) = 0 and we are interested in the behavior of the level curve 


F(x, y)=0 


of the function in a neighborhood of the point (x9, yo), we can judge that behavior 
from the position of the (tangent) line 


F,,(x0, Yo) (x — x0) + Fy (x0, yo)(y — yo) = 0. (8.82) 


If this line is situated so that its equation can be solved with respect to y, then, 
since the curve F(x, y) = 0 differs very little from this line in a neighborhood of 
the point (xo, yo), we may hope that it also can be written in the form y = y(x) in 
some neighborhood of the point (xo, yo). 

The same can be said about local solvability of F(x, y) = 0 with respect to x. 

Writing Eq. (8.82) for the specific relation (8.80), we obtain the following equa- 
tion for the tangent line: 


xo(x — xo) + yo(y — yo) = 0. 


This equation can always be solved for y when yo ¥ 0, that is, at all points of the 
circle (8.80) except (—1, 0) and (1, 0). It is solvable with respect to x at all points 
of the circle except (0, —1) and (0, 1). 


8.5.2 An Elementary Version of the Implicit Function Theorem 


In this section we shall obtain the implicit function theorem by a very intuitive, but 
not very constructive method, one that is adapted only to the case of real-valued 
functions of real variables. The reader can become familiar with another method 
of obtaining this theorem, one that is in many ways preferable, and with a more 
detailed analysis of its structure in Chap. 10 (Part 2), and also in Problem 4 at the 
end of the section. 

The following proposition is an elementary version of the implicit function the- 
orem. 
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Proposition 1 Jf the function F : U(xo, yo) ~ R defined in a neighborhood 
U (x0, yo) of the point (xo, yo) € R? is such that 


19 F €C”)(U;R), where p> 1, 
2° F(xo, yo) =0, 
3° Fi (x0, yo) #0, 


then there exist a two-dimensional interval I = I, x I, where 
I, ={xeER||x—xol<a},  I,={yeR|ly—yol <8}, 


that is a neighborhood of the point (xo, yo) contained in U(x, yo), and a function 
f €C) (Ly; Iy) such that 


FQ, y)=0oy=f(), (8.83) 


for any point (x,y) € I; x Ty and the derivative of the function y = f (x) at the 
points x € I, can be computed from the formula 


f'@=—[Fi(x, f@)] [F(x £@))- (8.84) 


Before taking up the proof, we shall give several possible reformulations of the 
conclusion (8.83), which should bring out the meaning of the relation itself. 

Proposition | says that under hypotheses 1°, 2°, and 3° the portion of the set 
defined by the relation F(x, y) = 0 that belongs to the neighborhood I, x Jy of the 
point (xo, yo) is the graph of a function f : I, > I, of class CHOU; Ty). 

In other words, one can say that inside the neighborhood J of the point (xo, yo) 
the equation F(x, y) = 0 has a unique solution for y, and the function y = f(x) is 
that solution, that is, F(x, f(x)) =0 on i,. 

It follows in turn from this that if y = a (x) 1s a function defined on J, that is 
known to satisfy the relation F(x, i (x))=O0on i, f (xo) = yo, and this function is 
continuous at the point xo € J,, then there exists a neighborhood A C J, of x9 such 
that f(A) C Jy, and then f(x) = f(x) for x € A. 

Without the assumption that the function f is continuous at the point xo and the 
condition 7 (xo) = yo, this last conclusion could turn out to be incorrect, as can be 
seen from the example of the circle already studied. 

Let us now prove Proposition |. 


Proof Suppose for definiteness that F’ (xo, yo) > 0. Since F € C(U;R), it fol- 
lows that F/(x, y) > 0 also in some neighborhood of (x9, yo). In order to avoid in- 
troducing new notation, we can assume without loss of generality that F > (x,y) >0 
at every point of the original neighborhood U (xo, yo). 

Moreover, shrinking the neighborhood U (xo, yo) if necessary, we can assume 
that it is a disk of radius r = 26 > 0 with center at (x9, yo). 

Since F 7 (x, y) > Oin U, the function F(xo, y) is defined and monotonically in- 
creasing as a function of y on the closed interval yo — 8B < y < yo + 8. Consequently, 


F (xo, yo — B) < F (x0, yo) = 0 < F (xo, yo + B). 
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By the continuity of the function F in U, there exists a positive number a < 6 such 
that the relations 


F(x, yo— B) <0 < F(x, yo + B) 


hold for |x — x9| <a. 
We shall now show that the rectangle J = I, x Jy, where 


Ip ={x ER| |x — x0] <a}, Iy={y eR | ly—yol < B}, 


is the required two-dimensional interval in which relation (8.83) holds. 

For each x € J, we fix the vertical closed interval with endpoints (x, yo — 6), 
(x, yo + B). Regarding F(x, y) as a function of y on that closed interval, we ob- 
tain a strictly increasing continuous function that assumes values of opposite sign 
at the endpoints of the interval. Consequently, for each x € /,, there is a unique 
point y(x) € Jy such that F(x, y(x)) = 0. Setting y(x) = f(x), we arrive at relation 
(8.83). 

We now establish that f € CP: Ty). 

We begin by showing that the function f is continuous at x and that f (xo) = yo. 
This last equality obviously follows from the fact that for x = xo there is a unique 
point y(xo) € Jy such that F(xo, y(xo)) = 0. At the same time, F'(x9, yo) = 0, and 
so f (xo) = yo- 

Given a number e, 0 < € < f, we can repeat the proof of the existence of the 
function f(x) and find a number 6, 0 < 6 <a such that in the two-dimensional 
interval [ = E Xx I y, where 


p= {xeER||x—x0l<5}, 4 ={yeR|ly—yol <e}, 


the relation 
(F(x, y)=0in 1) (y= f(a), x € hy) (8.85) 


holds with a new function 5 : L, = 7, ye 

But fy C Ty, Ly— C Iy, and I C I, and therefore it follows from (8.83) and (8.85) 
that f(x) = f(x) forxe i Cc I,. We have thus verified that | f(x) — f(xo)| = 
| f(x) — yo| < e for |x — xo| <6. 

We have now established that the function f is continuous at the point xo. But 
any point (x, y) € J at which F(x, y) = 0 can also be taken as the initial point of 
the construction, since conditions 2° and 3° hold at that point. Carrying out that 
construction inside the interval 7, we would once again arrive via (8.83) at the cor- 
responding part of the function f considered in a neighborhood of x. Hence the 
function f is continuous at x. Thus we have established that f ¢ C(U,; Jy). 

We shall now show that f €C Oe: I,) and establish formula (8.84). 

Let the number Ax be such that x + Ax € [,. Let y= f(x) and y+ Ay = 
f(x + Ax). Applying the mean-value theorem to the function F(x, y) inside the 
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interval J, we find that 
O= F(x+ Ax, f(x + Ax)) — F(x, f@)) = 
= F(x+Ax,y+ Ay) -— F(x, y)= 
= Fi(x+6Ax, y+OAy)Ax + Fy (x+6Ax,y+0Ay)Ay (0<6<1), 


from which, taking account of the relation F’ : (x, y) £0 in J, we obtain 


Ay Fi(x+@Ax, y + 0Ay) 
Ax Fi(x+0Ax,y+0Ay)’ 


(8.86) 


Since f € CU; Ly), it follows that Ay + 0 as Ax — 0, and, taking account of 
the relation F € C)(U; R), as Ax > 0 in (8.86), we obtain 


F(x, y) 


fo=-Fe 


where y = f (x). Thus formula (8.84) is now established. 

By the theorem on continuity of composite functions, it follows from formula 
(8.84) that f e CY C,; Jy). 

If F ¢ C~(U;R), the right-hand side of formula (8.84) can be differentiated 
with respect to x, and we find that 


[Fu + ee sae (x)] Le Fey ey -f' Cl 


(a (8.84) 


f"@)= 


where FY. i ye ey EG ae and Fy , are all computed at the point (x, f(x)). 

Thus f € COL: Ty) if F € COW: IR). Since the order of the derivatives of 
f on the right-hand side of (8.84), (8.84’), and so forth, is one less than the order 
on the left-hand side of the equality, we find by induction that f € C?) (Iy; 1 y) if 


FeC”(U,R). 


Example I Let us return to relation (8.80) studied above, which defines a circle 
in R?, and verify Proposition 1 on this example. 
In this case 
F(x, y)=x*+y’?- 1, 
and it is obvious that F € C‘°) (R?; R). Next, 


E@ye2s,  Fo,y)=2y, 


so that F) ‘(x, y) £0 if y £0. Thus, by Proposition 1, for any point (xo, yo) of this 
circle different from the points (—1, 0) and (1, 0) there is a neighborhood such that 
the arc of the circle contained in that neighborhood can be written in the form y = 


f (x). Direct computation confirms this, and f(x) = V1 — x? or f(x) = —V 1 — x?. 
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Next, by Proposition 1, 


F(x, Yo) Xo 


a= Fi(xo, Yo) Yo. 


(8.87) 


Direct computation yields 


sks, if f@)=VvI-2, 


’ 
=a2 


sit f@) =-VI-2#, 


which can be written as the single expression 


f= 


x 


we oi 
f(x) y 


and computation with it leads to the same result, 


f@)=—- 


’ 


f' (xo) = == 
yO 


as computation from formula (8.87) obtained from Proposition 1. 

It is important to note that formula (8.84) or (8.87) makes it possible to compute 
f' (x) without even having an explicit expression for the relation y = f(x), if only 
we know that f(x9) = yo. The condition yo = f (xo) must be prescribed, however, 
in order to distinguish the portion of the level curve F(x, y) = 0 that we intend to 
describe in the form y = f(x). 

It is clear from the example of the circle that giving only the coordinate xo does 
not determine an arc of the circle, and only after fixing yo have we distinguished 
one of the two possible arcs in this case. 


8.5.3 Transition to the Case of a Relation F(x!,...,x™, y) =0 


The following proposition is a simple generalization of Proposition | to the case of 
arelation F(x!,...,x”, y)=0. 


Proposition 2 If a function F : U > R defined in a neighborhood U C R"*! of 
the point (xo, yo) = Cas viesXy 130) = R™+! is such that 


19 FeC(U;R), p> 1, 
2” F(X, 9) SP Ogracca dg vO =0; 
3° Fa, yo) = FG + ep. 0) FO, 


then there exists an (m + 1)-dimensional interval I = 1!" x Te where 


1” = {x =(z',...,2") eR” | |x’ — xp| <a’, P= Tj sagt, 
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I, ={y €R||y—yol < B}, 


which is a neighborhood of the point (xo, yo) contained in U, and a function f € 
CHU; I}) such that for any point (x, y) € I" x i 


Fe acek™ SOS 9S FO peek”), (8.88) 
and the partial derivatives of the function y € f (x!,...,x'") at the points of Ix can 
be computed from the formula 

Foy = [Ri (x, FONT L(x, FOO] (8.89) 

af = v(x, £ (x, f(x))]- ; 


Proof The proof of the existence of the interval 7+! = 1” x I} and the existence 
of the function y = f(x) = f(x!,...,x”) and its continuity in I’” is a verbatim 
repetition of the corresponding part of the proof of Proposition 1, with only a single 
change, which reduces to the fact that the symbol x must now be interpreted as 
(x!,...,x”) anda as (a!,..., a’). 

If we now fix all the variables in the functions F(x!,...,x”, y) and Fx", sad 
x’) except x! and y, we have the hypotheses of Proposition 1, where now the role of 
x is played by the variable x’. Formula (8.89) follows from this. It is clear from this 
formula that ia ec"; 1;) (i =1,...,m), that is, f € Cou q: Reasoning 
as in the proof of Proposition 1, we establish by induction that f € C”) (I ee t) 
when F € C)(U; R). 


Example 2 Assume that the function F : G > R is defined in a domain G C 
IR” and belongs to the class C‘(G; R); x9 = Core a € G and F(x) = 
F ans eee x5) = 0. If xo is not a critical point of F, then at least one of the par- 


tial derivatives of F at xo is nonzero. Suppose, for example, that ao (xo) £0. 
Then, by Proposition 2, in some neighborhood of xo the subset of R” defined 
by the equation F(x!,...,x’”) = 0 can be defined as the graph of a function 
x” = f(x!,...,x7!), defined in a neighborhood of the point ‘cr a aa) € 
R”-! that is continuously differentiable in this neighborhood and such that 
‘oe es a) = 
Thus, in a neighborhood of a noncritical point x9 of F the equation 


Pe ii) =0 
defines an (m — 1)-dimensional surface. 
In particular, in the case of R? the equation 
F(x, y,z)=0 


defines a two-dimensional surface in a neighborhood of a noncritical point 
(xo, Yo, Zo) Satisfying the equation, which, when the condition oF (x0, yo, Zo) #0 
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holds, can be locally written in the form 


z= f,y). 


As we know, the equation of the plane tangent to the graph of this function at the 
point (x0, yo, Zo) has the form 


0 0 
Z-Z= ae. yo)(x — xo) + oe, yo)(y — yo). 
Ox oy 


But by formula (8.89) 


F (Xo, Yo, Z0) af F (x0, Yo, Zo) 
’ X0, Yo) = Sa 
F(x0, yo, Zo) dy F (x0, Yo, Z0) 


of 
=~ (X0, Yo) = 
Ox 

and therefore the equation of the tangent plane can be rewritten as 


F (x0, Yo. 20) (x — x0) + F;,(x0, yo, 20)(¥ — yo) + Fz (xo, yo, 20)(z — zo) = 0 


which is symmetric in the variables x, y, z. 
Similarly, in the general case we obtain the equation 


De i (x0)( x ‘x)= 0 


of the hyperplane in R” tangent at the point x9 = cae ..-,X9') to the surface given 
by the equation F (x Mooijx™)=0 (naturally, under the assumptions that F (xo) = 0 
and that xo is a noncritical point of F). 

It can be seen from these equations that, given the Euclidean structure on R”, 
one can assert that the vector 


OF OF 
grad F (xo) = aylt'''? Byam (xo) 


is orthogonal to the r-level surface F(x) =r of the function F at a corresponding 
point x9 € R”. 
For example, for the function 


No 
NO 
nN 


x ae z 
F(x, y,z)= an He + 2? 
defined in R?, the r-level is the empty set if r < 0, a single point if r = 0, and the 
ellipsoid 
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if r > 0. If (xo, yo, Zo) 18 a point on this ellipsoid, then by what has been proved, the 
vector 


2x0 2yo0 2z0 
grad F'(x0, yo, Zo) = (=. pe? 2) 


is orthogonal to this ellipsoid at the point (xo, yo, Zo), and the tangent plane to it at 
this point has the equation 


xo(x— x0) | yo(y— yo) , Z0(Z — 20) 
a2 + b2 " c2 =O 


which, when we take account of the fact that the point (xo, yo, Zo) lies on the ellip- 
soid, can be rewritten as 


Xox YOY | 20% 


ae ae 


8.5.4 The Implicit Function Theorem 


We now turn to the general case of a system of equations 


eG kW can SU; 


: (8.90) 
FOP acid sag”) =0, 
which we shall solve with respect to y!,..., y”, that is, find a system of functional 
relations 
yl= lice : ee a 
(8.91) 


y” = ro = on”); 
locally equivalent to the system (8.90). 

For the sake of brevity, convenience in writing, and clarity of statement, let us 
agree that x = (x!,...,x”"), y=(y!,..., y”). We shall write the left-hand side of 
the system (8.90) as F(x, y), the system of equations (8.90) as F(x, y) = 0, and the 
mapping (8.91) as y= f(x). 


If 
t= igaeesa W= (aid), 
(a 230"), BHP .05:4 8"); 
the notation |x —xo| < @ or |y—yo| < 6 will mean that |x! — xi] <a’ (i=1,...,m) 


or |y/ — yp | < B/ (j =1,...,n) respectively. 
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We next set 
af! af! 
Ox! ox™ 
fos] = % & PG); (8.92) 
af” af” 
ax! ax™ 
oF! oF! 
axl ox'™ 
Fas] > + 1,9), (8.93) 
oF” oF” 
Ox ox” 
art oF! 
ay ay" 
Fi, y)= : — : (x,y). (8.94) 
aF” aF” 
dy ay" 


We remark that the matrix F(x, y) is square and hence invertible if and only if 
its determinant is nonzero. In the case n = 1, it reduces to a single element, and in 
that case the invertibility of F. i“ (x, y) is equivalent to the condition that that single 
element is nonzero. As usual, we shall denote the matrix inverse to FY (x, y) by 
[Fi(x, I. 

We now state the main result of the present section. 


Theorem 1 (Implicit function theorem) /[f the mapping F : U —> R" defined in a 
neighborhood U of the point (xo, yo) € R™*” is such that 

19 Fec”)(U;R"), p= 1, 

2° F(xo, yo) =0, 

3° F 5 (x9, yo) is an invertible matrix, 


then there exists an (m + n)-dimensional interval I = I)" x I? C U, where 


1” = {x ER” | |x — xo] <a}, In ={y eR" | ly —yol < B}, 


and a mapping f € C(I"; Ty) such that 


Fa, y)=0e y= fF), (8.95) 


for any point (x, y) € IX" x I; and 


fa) =—[Fi(x, f@)] LF. f@)]. (8.96) 


Proof The proof of the theorem will rely on Proposition 2 and the elementary prop- 
erties of determinants. We shall break it into stages, reasoning by induction. 
For n = 1, the theorem is the same as Proposition 2 and is therefore true. 
Suppose the theorem is true for dimension n — 1. We shall show that it is then 
valid for dimension n. 
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a) By hypothesis 3°, the determinant of the matrix (8.94) is nonzero at the point 
(xo, yo) € R™*” and hence in some neighborhood of the point (xo, yo). Conse- 
quently at least one element of the last row of this matrix is nonzero. Up to a change 
in the notation, we may assume that the element ot i is nonzero. 

b) Then applying Proposition 2 to the relation ~ 


a ot tig m0, 


we find an interval /’”"+” = ie x ey x ee Cc U and a function 7 € Cwm x 
ier I) such that 


a (y= F(x"; Fie ee at 7s 
Cape ad) Mr ae ali ea (8.97) 
c) Substituting the resulting expression y” = f(x, y!,..., y’—!) for the variable 


y” in the first n — 1 equations of (8.90), we obtain n — | relations 


Db! (x! on a og! bate y= 
SE ii a ee I ee ee SO 


OO asi ees) = 


SET eee ey pee I ee ay Se 


O (tbs sep MN Vasey SO (i,...,n—1), 


Since F Ghia Agee Ie ) = hand F'Ga yo) 00 = 1, 0...) 
By definition of the functions @* (k=1,...,n—1), 
ape =oaFk ak af 


= oe Male), 8.99 
oy! oy! 7 dy” dy! (i 7 ) ( ) 


Further setting 
Oo" (x', eee Vis aoa) = 
— ae it oe ane fe Singh UY aias y"!)) 
we find by (8.97) that 6” = 0 in its domain of definition, and therefore 


ab" AF" OF" af | 
ay! = dy! ay" ay! ~ 


Oo STs oct Thy (8.100) 
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Taking account of relations (8.99) and (8.100) and the properties of determinants, 
we can now observe that the determinant of the matrix (8.94) equals the determinant 
of the matrix 


aF! , oF! af |. ar! | aF! af aF! 
ay! oy” ay! ayn! oy” ayn-l ay” 
ar", oF" af |. ar" , aF™ af arn 
ay! ay” ay! ay"! ay” ayr—l ay” 

amg!!! aF! 

ay! ay? dy" 

7 apr! _ apr! oFn-! 

ay! ayn! ay” 

ar” 

0 ea 0 a 

oF” 


By assumption, 35>, # 0, and the determinant of the matrix (8.94) is nonzero. 


Consequently, in some neighborhood of Gd: ee XO's Vi ona oe) the determinant 


of the matrix 


ag! ag! 
dy! — ayn-l 
‘ : 1 m .1 n—1 
en re ee re a 
ag"! ag"! 
ay! noe ayn-l 


is nonzero. 
Then by the induction hypothesis there exist an interval /’”"+"~! = 1” x I a E 


Te x a which is a neighborhood of Cie Jina Xhs os sian) in R”—!, and 
a mapping f ¢ C(I"; rly such that the system (8.98) is equivalent on the 
interval ["*"—! — [™ x J aa to the relations 


vel. (8.101) 
Tae als . =) 


d) Since 1?-! c 1-1, and 1” C 1, substituting f!,..., f”~! from (8.101) in 
place of the corresponding variables in the function 


SF a ete a) 


from (8.97) we obtain a relation 
PF age) (8.102) 


between y” and (x!,...,x”). 
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e) We now show that the system 


y! — fad 6 eee ad 
: poe =e hig (8.103) 
y= Pe ek 


which defines a mapping f ¢ C) (1; Ty), where Ty = Iy~ I 5 ye is equivalent to 
the system of equations (8.90) in the neighborhood J jn =1" x Bis : 

In fact, inside ("+" = (J hg x1 oe Me I : we began by replacing ne last equation 
of the original system (8.90) with the equality y” = f(x, y!,..., y"~!), which is 
equivalent to it by virtue of (8.97). From the second system so obtained, we passed 
to a third system equivalent to it by replacing the variable y” in the first n — 1 
equations with f(x, y!,..., y?~!). We then replaced the first n — 1 equations (8.98) 
of the third system inside 17” x 1?~! c 1” x [?~! with relations (8.101), which are 
equivalent to them. In that way, we obtained a fourth system, after which we passed 


to the final system (8.103), ae is equivalent to it inside [" x [7 ly i, late, 
n—1 


by rep lacane the variables y a y with their expressions (8. 101) in the faci 
equation y” = f(x!,...,x”, m yl ,..., ¥"7!) of the fourth system, obtaining (8.102) 
as the last Saiaren:,. 
f) To complete the proof of the theorem it remains only to verify formula (8.96). 
Since the systems (8.90) and (8.91) are equivalent in the neighborhood J(" x I?’ 
of the point (xo, yo), it follows that : 


F(x, f(x))=0, ifxe ly”. 


In coordinates this means that in the domain /(” 


PR ied OF a tia lied (hee HO (k=1,...,n). 
(8.104) 
Since f ¢ C(I"; I") and F € C'?)(U; R”), where p > 1, it follows that 


Fis fC) eemd ”; IR”) and, differentiating the identity (8.104), we obtain 


aFe Svar* afi 
rey aire (eal, gneiat, om). (8.105) 


= dyF Oxi 
Relations (8.105) are obviously equivalent to the single matrix equality 


Fix, y)+ Fi(x, y)- f'@) =0 


in which y = f(x). 
Taking account of the invertibility of the matrix E (x, y) in a neighborhood of 
the point (xo, yo), we find by this equality that 


f'@=-[Fi(x f@)]) [F(x £@)], 


and the theorem is completely proved. 
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8.5.5 Problems and Exercises 


1. On the plane R* with coordinates x and y a curve is defined by the relation 
F(x, y) =0, where F € C?)(IR?, R). Let (xo, yo) be a noncritical point of the func- 
tion F(x, y) lying on the curve. 


a) Write the equation of the tangent to this curve at this point (x0, yo). 
b) Show that if (xo, yo) is a point of inflection of the curve, then the following 
equality holds: 
(Fu, Fy — 2Fi, FF, + Fy, F2) (xo, yo) =0 


xy" Xx yy" x 
c) Find a formula for the curvature of the curve at the point (xo, yo). 


2. The Legendre transform in m variables. The Legendre transform of x!,...,x'" 


and the function f (x Dee x’) is the transformation to the new variables £1, ..., E 
and function f*(&,..., &) defined by the relations 


& = (x ensee) (i =1,...,m), 


il (8.106) 
Pisce =) oe HP A peak) 


a) Give a geometric interpretation of the Legendre transform (8.106) as the tran- 
sition from the coordinates (x!,..., x”, f (x!,...,x)) of a point on the graph of 
the function f (x) to the parameters (E),...,&m, f*(E1, ---, &m)) defining the equa- 
tion of the plane tangent to the graph at that point. 

b) Show that the Legendre transform is guaranteed to be possible locally if f € 


C® and det(; ay) 


x! axd 
c) Using ne same definition of convexity for a function f(x) = f (xt, oo. x”) 
as in the one-dimensional case (taking x to be the vector (x!,..., x") € R’), show 


that the Legendre transform of a convex function is a convex function. 
d) Show that 


m 


df* => abit 2 Bae -df= y+ dé, 


and deduce from this relation that the Legendre transform is involutive, that is, verify 
the equality 


(Py Gai: 


e) Taking account of d), write the transform (8.106) in the following form, which 
is symmetric in the variables: 
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PEactO Te wae) = Ge 
i= (8.107) 


OF aga x = = 


&= axl 


or, more briefly, in the form 


FYE) + f@) = Ex, §=Vf (x), x=Vf"(), 


where 


0 ) or* af* 
vie = (Tae. a). vyre@= (Tt Je. 


Se = es 
i=1 


f) The matrix formed from the second-order partial derivatives of a function 
(and sometimes the determinant of this matrix) is called the Hessian of the function 
at a given point. 


Let dj; and dij * be the co-factors of the elements ot and af ca 7 of the Hessians 
a? f a2 f a2 f* hes a? f* 
axtax! °° a@xtaxm 951981 0&10Em 
; a : (x), ; cee : (&) 
af af 2 his a f* 
ax™ax! aos ox™ox™ d&m0E1 ae 0&m0Em 


of the functions f(x) and f*(&), and let d and d* be the determinants of these 
matrices. Assuming that d 4 0, show that d - d* = 1 and that 


axtaxt ae aga, 


é) = M4@) 
aero) 


g) A soap film spanning a wire frame forms a so-called minimal surface, having 
minimal area among all the surfaces spanning the contour. 

If that surface is locally defined as the graph of a function z = f(x, y), it turns 
out that the function f must satisfy the following equation for minimal surfaces: 


(1+ A) A 2A AG + (LF FOG, =0. 


Show that after a Legendre transform is performed this equation is brought into 
the form 


(L+0°) fin” + 2&nfén" + (1+ &) fee” =0 
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3. Canonical variables and the Hamilton equations.’ 


a) In the calculus of variations and the fundamental principles of classical me- 
chanics the following system of equations, due to Euler and Lagrange, plays an 


important role: 
oL ddoL 
Tae Sa (t, x, v) = 0, 
ox dt dv 


v=X(t), 


(8.108) 


where L(t, x, v) is a given function of the variables t, x, v, of which ¢ is usually 
time, x the coordinate, and v the velocity. 

The system (8.108) consists of two relations in three variables. Usually we wish 
to determine x = x(t) and v = v(t) from (8.108), which essentially reduces to de- 
termining the relation x = x(t), since v = ee 

Write the first equation of (8.108) in more detail, expanding the derivative a 
taking account of the equalities x = x(t) and v = v(t). 

b) Show that if we change from the coordinates t,x, v, L to the so-called canon- 
ical coordinates t,x, p, H by performing the Legendre transform (see Problem 2) 


_ OL 
oe Th 
H=pv—-L 


with respect to the variables v and L to replace them with p and H, then the Euler— 
Lagrange system (8.108) assumes the symmetric form 


oH __ oH 


= —, 8.109 
Ox “ ap ( ) 


in which it is called system of Hamilton equations. 
c) In the multidimensional case, when L = L(t, x!,...,x,v!,...,v™) the 
Euler—Lagrange system has the form 


ox! = dt dv! (8.110) 
v=x'(t) G@=1,...,m), 
where for brevity we have set x = (xt, x), v=(vl,..., vl”). 
By performing a Legendre transform with respect to the variables v!,..., v’", L, 
change from the variables f, x oo. x’, ul, ...,u™, L to the canonical variables 


8W.R. Hamilton (1805-1865) — famous Irish mathematician and specialist in mechanics. He stated 
a variational principle (Hamilton’s principle), constructed a phenomenological theory of optic phe- 
nomena, and was the creator of quaternions and the founder of vector analysis (in fact, the term 
“vector” is due to him). 
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t,x o...,x™, P1,-+-, Pm, H and show that in these variables the system (8.110) 
becomes the following system of Hamilton equations: 


dH ., 0H 


L 


ee ODi 
L 


Pi= (i =1,...,m). (8.111) 
4. The implicit function theorem. 

The solution of this problem gives another proof of the fundamental theorem of this 
section, perhaps less intuitive and constructive than the one given above, but shorter. 


a) Suppose the hypotheses of the implicit function theorem are satisfied, and let 


OF! OF! 
Fy(x,y)= (rie a ey) 


be the ith row of the matrix F} (x, y). 

Show that the determinant of the matrix formed from the vectors F (x;, yj) is 
nonzero if all the points (x;, yj)  =1,..., 7) lie in some sufficiently small neigh- 
borhood U = I!” x I? of (xo, yo). 

b) Show that, if for x € /!” there are points y;, y2 € / such that F(x, y1) =0 
and F(x, yo) = 0, then for each i € {1,...,m} there is a point (x, y;) lying on the 
closed interval with endpoints (x, y;) and (x, y2) such that 


Fi@,y)O2-y)=0 G@=1,...,n). 


Show that this implies that yj = yo, that is, if the implicit function f : 1” > I? 
exists, it is unique. 

c) Show that if the open ball B(yo; r) is contained in 1’, then F'(xo, y) 4 0 for 
lly — yollex =r > 0. 

d) The function || F (xo, y) lljen is continuous and has a positive minimum value 
u on the sphere || y — yo|lrx =r. 

e) There exists 6 > 0 such that for ||x — xg||ja~ <6 we have 


1 
| Fy = 5H if lly — voller =r 


| Fe.» 


> .% 
Rn < ah if y = yo- 


f) For any fixed x such that ||x — xo|| <6 the function || F(x, y)Ilon attains a 
minimum at some interior point y = f (x) of the open ball || y — yo||j» <7, and since 
the matrix F ; (x, f (x)) is invertible, it follows that F(x, f(x)) = 0. This establishes 
the existence of the implicit function f : B(xo; 6) > B(yo; 7). 


g) If Ay= f(x + Ax) — f(x), then 


ay =F) [FJan, 
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where Fy is the matrix whose rows are the vectors oe (xj, yj) G@=1,...,7), i, i) 
being a point on the closed interval with endpoints (x, y) and (x + Ax, y + Ay). 
The symbol F ’ has a similar meaning. 

Show that this relation implies that the function y = f(x) is continuous. 

h) Show that 


v~ -1 
f@=-[B(x.f@)] - [Fi £@))]. 
5. “If f(, y, z) =0, then * . oy . oe =-1.” 
a) Give a precise meaning to this statement. 


b) Verify that it holds in the example of Clapeyron’s ideal gas equation 


P-.V 
——— = const 
T 


and in the general case of a function of three variables. 
c) Write the analogous statement for the relation f(x!,...,x’”’) =0 among m 
variables. Verify that it is correct. 


6. Show that the roots of the equation 
zh eyz" | $e ten =0 


are smooth functions of the coefficients, at least when they are all distinct. 


8.6 Some Corollaries of the Implicit Function Theorem 
8.6.1 The Inverse Function Theorem 


Definition 1 A mapping f : U — V, where U and V are open subsets of R’”, is a 
C'?) diffeomorphism or a diffeomorphism of smoothness p (p = 0, 1, ...), if 


1) fecU;V); 
2) f is a bijection; 
3) fole CPV; U). 


A C)-diffeomorphism is called a homeomorphism. 


As a rule, in this book we shall consider only the smooth case, that is, the case 
pENor p=. 

The basic idea of the following frequently used theorem is that if the differential 
of a mapping is invertible at a point, then the mapping itself is invertible in some 
neighborhood of the point. 
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Theorem 1 (Inverse function theorem) [fa mapping f : G > R” of a domain G C 
R” is such that 


1° fec”(G;R”), p>, 
29 yo = f (xo) at x0 €G, 
3° f'(x0) is invertible, 


then there exists a neighborhood U(xo) C G of xo and a neighborhood V (yo) of 
yo such that f : U(xo) > V(yo) is a C?)-diffeomorphism. Moreover, if x € U (xo) 
and y = f (x) € V(y0), then 


(FY) =(F'@) 
Proof We rewrite the relation y = f(x) in the form 
F(x,y)=f)-y=0. (8.112) 


The function F(x, y) = f(x) — y is defined for x € G and y € R”, that is it is 
defined in the neighborhood G x R” of the point (xo, yo) € R” x R”. 

We wish to solve Eq. (8.112) with respect to x in some neighborhood of (x0, yo). 
By hypotheses 1°, 2°, 3° of the theorem the mapping F(x, y) has the property that 


Fec(GxR";R"), p21, 
F (xo, yo) =9, 


F’(xo, yo) = f’(xo) _ is invertible. 


By the implicit function theorem there exist a neighborhood J, x Iy of (xo, yo) 
and a mapping g € CH: I) such that 


fa)-y=0ex=a(y) (8.113) 
for any point (x, y) € Jy x Jy and 
sy) =-[F@»] [FG y)]- 
In the present case 
F.@.y=f'@), F@.y)=-E, 
where E is the identity matrix; therefore 
go) =(f'@) (8.114) 


If we set V = Jy and U = g(V), relation (8.113) shows that the mappings f : 
U — V and g: V > U are mutually inverse, that is, g = f~! on V. 
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Since V = Jy, it follows that V is a neighborhood of yo. This means that under 
hypotheses 1°, 2°, and 3° the image yo = f (xo) of x9 € G, which is an interior point 
of G, is an interior point of the image f(G) of G. By formula (8.114) the matrix 
g'(yo) is invertible. Therefore the mapping g: V — U has properties 1°, 2°, and 
3° relative to the domain V and the point yg € V. Hence by what has already been 
proved xo = g(yo) is an interior point of U = g(V). 

Since by (8.114) hypotheses 1°, 2°, and 3° obviously hold at any point y € V, 
any point x = g(y) is an interior point of U. Thus U is an open (and obviously even 
connected) neighborhood of x9 € R”. 

We have now verified that the mapping f : U — V satisfies all the conditions of 
Definition | and the assertion of Theorem 1. 


We shall now give several examples that illustrate Theorem 1. 

The inverse function theorem is very often used in converting from one coordi- 
nate system to another. The simplest version of such a change of coordinates was 
studied in analytic geometry and linear algebra and has the form 


yn at San am xm 
or, in compact notation, ys = a; x!. This linear transformation A : RY — RY has 
an inverse A~! : IR — RY defined on the entire space R‘? if and only if the matrix 
(a} ) is invertible, that is, det(a/ ) 40. 

The inverse function theorem is a local version of this proposition, based on the 
fact that in a neighborhood of a point a smooth mapping behaves approximately like 
its differential at the point. 


Example I (Polar coordinates) The mapping f : Re —> R? of the half-plane R?. = 
{(p, ~) € R* | p = 0} onto the plane R? defined by the formula 
xX = Pcosg, 
(8.115) 
y=psing, 


is illustrated in Fig. 8.5. 
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Fig. 8.6 


The Jacobian of this mapping, as can be easily computed, is p, that is, it is 
nonzero in a neighborhood of any point (p, ¢~), where p > 0. Therefore formulas 
(8.115) are locally invertible and hence locally the numbers p and ¢ can be taken as 
new coordinates of the point previously determined by the Cartesian coordinates x 
and y. 

The coordinates (p, g) are a well known system of curvilinear coordinates on the 
plane — polar coordinates. Their geometric interpretation is shown in Fig. 8.5. We 
note that by the periodicity of the functions cos g and sing the mapping (8.115) is 
only locally a diffeomorphism when p > 0; it is not bijective on the entire plane. 
That is the reason that the change from Cartesian to polar coordinates always in- 
volves a choice of a branch of the argument ¢ (that is, an indication of its range of 
variation). 

Polar coordinates (p, yv, y) in three-dimensional space R° are called spherical 
coordinates. They are connected with Cartesian coordinates by the formulas 


z=pcosy, 
y=psiny sing, (8.116) 


x=psinwcosg. 


The geometric meaning of the parameters p, yy, and @ is shown in Fig. 8.6. 

The Jacobian of the mapping (8.116) is p* sin, and so by Theorem | the map- 
ping is invertible in a neighborhood of each point (p, w, g) at which p > 0 and 
siny #0. 

The sets where o = const, g = const, or w = const in (x, y, z)-space obviously 
correspond to a spherical surface (a sphere of radius (), a half-plane passing through 
the z-axis, and the surface of a cone whose axis is the z-axis respectively. 

Thus in passing from coordinates (x, y, z) to coordinates (p, ¥, g), for example, 
the spherical surface and the conical surface are flattened; they correspond to pieces 
of the planes = const and w = const respectively. We observed a similar phe- 
nomenon in the two-dimensional case, where an arc of a circle in the (x, y)-plane 
corresponded to a closed interval on the line in the plane with coordinates (p, g) 
(see Fig. 8.5). Please note that this is a local straightening. 
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In the m-dimensional case polar coordinates are introduced by the relations 
1_ 
xX" = pcos, 


x? = psing) cos¢2, 


(8.117) 
qn /P SiN G1 SiN P2 + + - SIN Py —2 COS Pm —1, 
x” = posing] Sing? --+SiN@p—2 SINPm_1. 
The Jacobian of this transformation is 
m—1 o:.m—2 +s m—3 ‘ 
p sin gy, sin $2 -++SIN@n—2, (8.118) 


and by Theorem | it is also locally invertible everywhere where this Jacobian is 
nonzero. 


Example 2 (The general idea of local rectification of curves) New coordinates are 
usually introduced for the purpose of simplifying the analytic expression for the 
objects that occur in a problem and making them easier to visualize in the new 
notation. 

Suppose for example, a curve in the plane R? is defined by the equation 


F(x, y)=0. 


Assume that F is a smooth function, that the point (xo, yo) lies on the curve, that is, 
F (xo, yo) = 0, and that this point is not a critical point of F. For example, suppose 
Fy(x, y) #0. 
Let us try to choose coordinates €, 7 so that in these coordinates a closed interval 
of a coordinate line, for example, the line 7 = 0, corresponds to an arc of this curve. 
We set 


&é=x-—x0, n=F(x,y). 


1 O 
(rm )or 
/ 


of this transformation has as its determinant the number F;,(x, y), which by assump- 
tion is nonzero at (xo, yo). Then by Theorem 1, this mapping is a diffeomorphism of 
a neighborhood of (xo, yo) onto a neighborhood of the point (&, 7) = (0, 0). Hence, 
inside this neighborhood, the numbers & and 7 can be taken as new coordinates of 
points lying in a neighborhood of (xo, yo). In the new coordinates, the curve ob- 
viously has the equation 7 = 0, and in this sense we have indeed achieved a local 
rectification of it (see Fig. 8.7). 


The Jacobi matrix 
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Fig. 8.7 y F(z,y) =0 
1 
0 x 
Fig. 8.8 
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& cepeu 


8.6.2 Local Reduction of a Smooth Mapping to Canonical Form 


In this subsection we shall consider only one question of this type. To be specific, we 
shall exhibit a canonical form to which one can locally reduce any smooth mapping 
of constant rank by means of a suitable choice of coordinates. 

We recall that the rank of a mapping f : U > R” of adomain U C R” at a point 
x €U is the rank of the linear transformation tangent to it at the point, that is, the 
rank of the matrix f’(x). The rank of a mapping at a point is usually denoted rank 


Ff (x). 


Theorem 2 (The rank theorem) Let f : U > R” be a mapping defined in a neigh- 
borhood U C R" of a point xo € R”. If f ¢ C”)(U; R"), p = 1, and the mapping 
f has the same rank k at every point x € U, then there exist neighborhoods O(x0) 
of x9 and O(yo) of yo = f (xo) and diffeomorphisms u = 9(x), v = W(y) of those 
neighborhoods, of class C'?), such that the mapping v = Wo f o g~'(u) has the 
coordinate representation 


Hie ne VSR IS ei SE Sa O60) 119) 
in the neighborhood O(ug) = ¢(O(x0)) of Uo = G(X0). 


In other words, the theorem asserts (see Fig. 8.8) that one can choose coordinates 
(ul,...,u™) in place of (x!,..., x) and (v!,..., v") in place of (y!,..., y”) in 
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such a way that locally the mapping has the form (8.119) in the new coordinates, 
that is, the canonical form for a linear transformation of rank k. 


Proof We write the coordinate representation 


(8.120) 


y"” . Pe peek”) 


of the mapping f : U — RY‘, which is defined in a neighborhood of the point 
xo € R”. In order to avoid relabeling the coordinates and the neighborhood U, we 
shall assume that at every point x € U, the principal minor of order k in the upper 
left corner of the matrix f’(x) is nonzero. 


Let us consider the mapping defined in a neighborhood U of xo by the equalities 


u! SO tage (Spe eg 


wk = gel, gx” = fk Maviooa se 
ykti -_ gat ; x) = xhtl (8.121) 
u™ =o" (x', ea, 


The Jacobi matrix of this mapping has the form 


af! ae af! 
ax! ~ axk 7 axk+l a ox™ 
aft afk : _afk aft 
ax! vie axk : axktl Ves ax™m 
bi 
1 0 
0 
0 1 


and by assumption its determinant is nonzero in U. 
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By the inverse function theorem, the mapping u = g(x) is a diffeomorphism 
of smoothness p of some neighborhood O(xo) C U of xo onto a neighborhood 
O(uo) = g(O(x0)) of uo = Y(X0). 

Comparing relations (8.120) and (8.121), we see that the composite function g = 
foe: O(ug) > IR* has the coordinate representation 


a — ¥ ao at ...0gH™) =u!, 


(8.122) 


ya fF” BO osngl) SE" (Use agtl™)s 


Since the mapping y~! : O(ug) > O(xo) has maximal rank m at each point u € 
O(uo), and the mapping f : O(x0) > R} has rank k at every point x € O (x), it fol- 
lows, as is known from linear algebra, that the matrix g/(u) = f’(g~!(u))(g~!) (w) 
has rank k at every point u € O(uo). 

Direct computation of the Jacobi matrix of the mapping (8.122) yields 


1 0 
0 
0 1 
agkt! agktl : agktl agkt! 
aul ae auk *  QuktT dul 
ag" dg” . dg” ag" 
aul ae auk 7 QukeT aul” 
‘ ~ ; j . 
Hence at each point u € O(uo) we obtain 3° (u) =0 fori =k+1,...,m; 


j=k+1,...,n. Assuming that the neighborhood O(uo) is convex (which can 
be achieved by shrinking O(uo) to a ball with center at uo, for example), we can 
conclude from this that the functions g/, j =k+1,...,n, really are independent of 


the variables u*t!, 20.) wu”. 
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After this decisive observation, we can rewrite the mapping (8.122) as 


Se 
‘ie (8.123) 
yetl = ghtl(yl uk), 


y= Cas eit"), 
At this point we can exhibit the mapping w. We set 


vay =p iy), 


= 7"), 


eee es (8.124) 
perl = yk — gktI(yl yk) =: yhtl(yy, : 
vt = y" — g(y!,..., yF) = w"(y). 
It is clear from the construction of the functions g/ (j =k+1,...,n) that the 
mapping y is defined in a neighborhood of yo and belongs to class C“?) in that 


neighborhood. 
The Jacobi matrix of the mapping (8.124) has the form 


1 0 
0 
0 1 
agkt! agkt! 
~~ ay! yk ! 0 
ag” dg” 
ay — ayk a : 


Its determinant equals 1, and so by Theorem | the mapping y is a diffeomor- 
phism of smoothness p of some neighborhood O(yo) of yo € R', onto a neighbor- 
hood O(v9) = W(O(yo)) of vo € R". 

_ Comparing relations (8.123) and (8.124), we see that in a neighborhood O(uo) C 
O(uo) of uo so small that g(O(uo)) C O(y), the mapping wo f og7!: O(u9) > 
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R}, is a mapping of smoothness p from this neighborhood onto some neighborhood 
O(v9) C O(v9) of vo € R% and that it has the canonical form 


v=u, 
uk = wk 
; (8.125) 
pktl = 0, 
vy" - 0 


Setting g~!(O(uo)) = O(xo) and w~!(O(v9)) = O(y0), we obtain the neigh- 
borhoods of xo and yo whose existence is asserted in the theorem. The proof is now 
complete. 


Theorem 2, like Theorem |, is obviously a local version of the corresponding 
theorem from linear algebra. 

In connection with the proof just given of Theorem 2, we make the following 
remarks, which will be useful in what follows. 


Remark I If the rank of the mapping f : U > R” is n at every point of the original 
neighborhood U C R”, then the point yo = f (xo), where xo € U, is an interior point 
of f(U), that is, f(U) contains a neighborhood of this point. 


Proof Indeed, from what was just proved, the mapping wo f og! : O(uo) > 
O(vo) has the form 


2s gl”) =a P= (sas VS Oe ciscgl”) 
in this case, and so the image of a neighborhood of up = y(xg) contains some neigh- 
borhood of vp = Wo f og! (ug). 

But the mappings g : O(xo) > O(uo) and Ww: O(yo) > Ovo) are diffeomor- 
phisms, and therefore they map interior points to interior points. Writing the original 
mapping f as f =w~!o(wWo fog!) og, we conclude that yo = f (x) is an in- 
terior point of the image of a neighborhood of xo. 


Remark 2 If the rank of the mapping f : U > R” is k at every point of a neighbor- 
hood U and k <n, then, by Eqs. (8.120), (8.124), and (8.125), in some neighbor- 
hood of x9 € U C R” the following n — k relations hold; 


re: Sega | SO aa ee: Ce ae) (Gi =k-+1,...,n). 
(8.126) 
These relations are written under the assumption we have made that the principal 
minor of order k of the matrix f’ (xq) is nonzero, that is, the rank k is realized on the 
set of functions f!,..., f*. Otherwise one may relabel the functions f!,..., f” 
and again have this situation. 
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8.6.3 Functional Dependence 


Definition 2 A system of continuous functions f!(x) = fi(x!,...,x”) G = 1, 
...,M) is functionally independent in a neighborhood of a point xo = Gi; eee 
if for any continuous function F(y) = F(y!,..., y”) defined in a neighborhood of 


yo = Gee 1085 Ve) = (f!(x0), ..., f"(%9)) = f (xo), the relation 


is possible at all points of a neighborhood of xo only when F(y!,..., y?)=Oina 
neighborhood of yo. 


The linear independence studied in algebra is independence with respect to linear 
relations 


F(y',...,y") =Ary) tees + Any”. 


If a system is not functionally independent, it is said to be functionally dependent. 

When vectors are linearly dependent, one of them obviously is a linear combina- 
tion of the others. A similar situation holds in the relation of functional dependence 
of a system of smooth functions. 


Proposition 1 /f a system fi(x!,...,x™) (i =1,...,n) of smooth functions de- 
fined on a neighborhood U (xo) of the point x9 € R” is such that the rank of the 
matrix 


af! af! 

: : |) 
ay” af” 
fe 


is equal to the same number k at every point x € U, then 


a) when k =n, the system is functionally independent in a neighborhood of xo; 
b) when k <n, there exist a neighborhood of xo and k functions of the system, 
say f!,..., f* such that the other n — k functions can be represented as 


in this neighborhood, where g'(y',..., y*), (i =k+1,...,n) are smooth functions 
defined in a neighborhood of yo = (f'(xo),..., f"(x0)) and depending only on k 
coordinates of the variable point y = (y!,..., y"). 


Proof In fact, if k =n, then by Remark | after the rank theorem, the image of a 
neighborhood of the point xo under the mapping 
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y! = fie, ae ae), 
(8.127) 

y= fx, FO is) 


contains a neighborhood of yo = f(x). But then the relation 


POP ie” tesa ee eae SO 


can hold in a neighborhood of xo only if 
F(y'!,...,y") =0 


in a neighborhood of yo. This proves assertion a). 

If k <n and the rank k of the mapping (8.127) is realized on the functions 
f',..., f*, then by Remark 2 after the rank theorem, there exists a neighborhood of 
yo = f (xo) and n — k functions g!(y) = g/(y!,...,y G@ =k+1,...,n), defined 
on that neighborhood, having the same order of smoothness as the functions of the 
original system, and such that relations (8.126) hold in some neighborhood of xo. 
This proves b). 


We have now shown that if k <n there exist n — k special functions F i (y) = 
yi — g' Ge, suns yk) (i =k+1,...,n) that establish the relations 


Fi(f'(x),..., fF @), f'@))=0 G@=k+1,...,n) 


between the functions of the system f!,..., f*,..., f” in a neighborhood of the 
point xo. 


8.6.4 Local Resolution of a Diffeomorphism into a Composition 
of Elementary Ones 


In this subsection we shall show how, using the inverse function theorem, one can 
represent a diffeomorphic mapping locally as a composition of diffeomorphisms, 
each of which changes only one coordinate. 


Definition 3 A diffeomorphism g : U > R” of an open set U C R” will be called 
elementary if its coordinate representation is 

y=x', ie{l,....m}, i4j, 

yl = gi (x!, sey a 


that is, under the diffeomorphism g : U — R” only one coordinate of the point 
being mapped is changed. 
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Proposition 2 [f f : G — R” is a diffeomorphism of an open set G C R", then for 
any point xo € G there is a neighborhood of the point in which the representation 
f =810°--+0 gy holds, where g\,..., 8n are elementary diffeomorphisms. 


Proof We shall verify this by induction. 
If the original mapping f is itself elementary, the proposition holds trivially for it. 
Assume that the proposition holds for diffeomorphisms that alter at most (k — 1) 
coordinates, where k — 1 <n. Now consider a diffeomorphism f : G — R” that 
alters k coordinates: 


(8.128) 


We have assumed that it is the first kK coordinates that are changed, which can 
be achieved by linear changes of variable. Hence this assumption causes no loss in 
generality. 

Since f is a diffeomorphism, its Jacobi matrix f’(x) is nondegenerate at each 
point, for 


FY (¢@)=[F@]". 


Let us fix x9 € G and compute the determinant of f’(xo): 


af! af! : af! af! 
Sed A eee ee SY 
af! aft 
af* af* : af* af k ax! axk 
Axt Bx OxT TT B™ x=]. | 0) #0. 
ax! axk 
1 0 
0 
0 1 


Thus one of the minors of order k — 1 of this last determinant must be nonzero. 
Again, for simplicity of notation, we shall assume that the principal minor of order 
k — 1 is nonzero. Now consider the auxiliary mapping g : G > R” defined by the 
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equalities 


k-1 k-1(,1 m 
u a i 5 aera ia | 
( ) (8.129) 
uk = xk, 
u”™ =x", 
Since the Jacobian 
af! af! af! af! 
ax! axk-l axk ax™ 
Ce 
i eo) 2 oe aft! ax! o 
coe aT OR OR yy =f ot |x) £0 
: ., : : : . : afk! afkel 
ax! axk-l 
1 0 
0 
0 1 


of the mapping g : G > R” is nonzero at xo € G, the mapping g is a diffeomor- 
phism in some neighborhood of xo. 

Then, in some neighborhood of uo = g(xo) the mapping inverse to g, x = 
gl (u), is defined, making it possible to introduce new coordinates (u!,..., ul”) 
in a neighborhood of xo. 

Let h = f o g~!. In other words, the mapping y = h(w) is the mapping (8.128) 
y = f(x) written in u-coordinates. The mapping h, being the composition of dif- 
feomorphisms, is a diffeomorphism of some neighborhood of ug. Its coordinate 
expression obviously has the form 


that is, h is an elementary diffeomorphism. 
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But f =hog, and by the induction hypothesis the mapping g defined by (8.129) 
can be resolved into a composition of elementary diffeomorphisms. Thus, the dif- 
feomorphism f, which alters k coordinates, can also be resolved into a composition 
of elementary diffeomorphisms in a neighborhood of x9, which completes the in- 
duction. 


8.6.5 Morse’s Lemma 


This same circle of ideas contains an intrinsically beautiful lemma of Morse” on the 
local reduction of smooth real-valued functions to canonical form in a neighborhood 
of a nondegenerate critical point. This lemma is also important in applications. 


Definition 4 Let xo be a critical point of the function f €¢ C®(U; R) defined ina 
neighborhood U of this point. 
The critical point xo is a pues Chas point of f if the Hessian of the 


function at that point (that is, the matrix a7 5 a Fi 
partial derivatives) has a nonzero detemminact 

If xo is a critical point of the function, that is, f’(xo) = 0, then by Taylor’s for- 
mula 


——— (xo) formed from the second-order 


2 


Ox'dx/ 


1 . ; 
f(x) — fo) = = 25 (xo) (x! — x4) (x — xd) + o(|lx — x0ll?). (8.130) 


Morse’s lemma asserts that one can make a local change of coordinates x = g(y) 
such that the function will have the form 


(f og)(y) — f(x) =—(y!)? — + = (y*)? + (yet? + + (p"? 


when expressed in y-coordinates. 

If the remainder term o0(||x — xo||”) were not present on the right-hand side of 
Eq. (8.130), that is, the difference f(x) — f (xo) were a simple quadratic form, then, 
a is known from algebra, it could be brought into the indicated canonical form by 
a linear transformation. Thus the assertion we are about to prove is a local version 
of the theorem on reduction of a quadratic form to canonical form. The proof will 
use the idea of the proof of this algebraic theorem. We shall also rely on the inverse 
function theorem and the following proposition. 


Hadamard’s lemma Let f : U > R be a function of class C’)(U;R), p = 1, 
defined in a convex neighborhood U of the point 0 = (0,...,0) € R” and such that 


°H.C.M. Morse (1892-1977) — American mathematician; his main work was devoted to the appli- 
cation of topological methods in various areas of analysis. 
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f (0) =0. Then there exist functions g; € C'’?~) (U; R) i =1,...,m) such that the 
equality 


m 
PE cite = FE Be i”) (8.131) 
j=1 


holds in U, and g;(0) = a (0). 


Proof Equality (8.131) is essentially another useful expression for Taylor’s formula 
with the integral form of the remainder term. It follows from the equalities 


f(x: tee a= [ =o 1 tX™) =D fz a at seg) dt, 


if we set 


He awk = =f ae newesee Vide (i=1,...,m). 


The fact that g;(0) = a F (0) (i =1,...,m) is obvious, and it is also not difficult 
to verify that g; € C @-D (YU; R). eevee we shall not undertake the verification 
just now, since we shall later give a general rule for differentiating an integral de- 
pending on a parameter, from which the property we need for the functions g; will 
follow immediately. 

Thus, up to this verification, Hadamard’s formula (8.131) is proved. 


Morse’s lemma /f f : G — R is a function of class C°)(G; R) defined on an open 
set G C R” and xo € G is a nondegenerate critical point of that function, then there 
exists a diffeomorphism g:V — U of some neighborhood of the origin 0 in R” 
onto a neighborhood U of xg such that 


(f og)(y) = f (x0) — [(y')? neh (y*)7] + [(yk)? qededee (y")"] 
forallyeV. 


Proof By linear changes of variable we can reduce the problem to the case when 

xo = 0 and f (xo) = 0, and from now on we shall assume that these conditions hold. 

Since x9 = 0 is acritical point of f, we have g; (0) = 0 in formula (8.131) (i = 1, 
.,m). Then, also by Hadamard’s lemma, 


m 
BAe seatgk™) Se RC aia); 
j=l 
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where /;; are smooth functions in a neighborhood of 0 and consequently 


m 
i a ye a lig (cia). (8.132) 


i,j=l 


By making the substitution h; j= 5(h ij + ;;) if necessary, we can assume that 
hj; = hj;. We remark also that, by the uniqueness of = Taylor expansion, the con- 


tinuity of the functions h;; implies that h;;(0) = 
(hj; (0)) is nondegenerate. 

The function f has now been written in a manner that resembles a quadratic 
form, and we wish, so to speak, to reduce it to diagonal form. 

As in the classical case, we proceed by induction. 

Assume that there exist coordinates u!,...,u’" ina neighborhood U, of 0 € R”, 
that is, a diffeomorphism x = g(u), such that 


wt 7 (0) and hence the matrix 


(fogu)=+(ul)? +--+ (wi) + So wal Hi(u',...,w) (8.133) 


i,j=r 


in the coordinates u!,..., u’", where r > 1 and Hj; = Hi. 

We observe that relation (8.133) holds for r = 1, as one can see from (8.132), 
where Hj; = hij. 

By the hypothesis of the lemma the quadratic form )7j";_| x ‘x/hjj (0) is non- 
degenerate, that is, det(h;;(0)) 4 0. The change of variable x = (uw) is carried out 
by a ee so that det gy’ (0) 0. But then the matrix of the quadratic 
form +(u!)?+---+(u"—!)2 2a Ae u'u! H;; (0) obtained from the matrix (h;; (0)) 
through right-multiplication by the mii g’ (0) and left-multiplication by the trans- 
pose of y’(0) is also nondegenerate. Consequently, at least one of the numbers 
Hj; (0) G, j =r, ...,m) is nonzero. By a linear change of variable we can bring the 
form >>” pee ‘u/ H;;(O) to diagonal form, and so we may assume that H,,.(0) 4 0 
in Eq. (8. 133). By the continuity of the functions Hj; (u) the inequality H,,;(u) 4 0 
will also hold in some neighborhood of u = 0. 

Let us set y(u!,...,u’”) = /]H,,(u)|. Then the function y belongs to the class 
C (Uz; R) in some neighborhood U2 Cc U, of u = 0. We now change to coordi- 
nates (vi, ...,v") by the formulas 


vi=u', ixr, 


‘Hir(ut yu 8.134 
=o ul wy (w+ SEE guia ng ll ’). ( ) 


H,, (ul, ...,H”) 


i>r 


The Jacobian of the transformation (8.134) at u = 0 is obviously equal to (0), 
that is, it is nonzero. Then by the inverse function theorem we can assert that in 
some neighborhood U3 C U2 of u = 0 the mapping v = w(u) defined by (8.134) 
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is a diffeomorphism of class C“!) (U3; R’”) and therefore the variables (v!,..., v’”) 
can indeed serve as coordinates of points in U3. 
We now separate off in Eq. (8.133) all terms 


m 
i edit oa ul”) +2 a uu! H,j(u', ae ‘uy, (8.135) 
j=rtl 
containing u’. In the expression (8.135) for the sum of these terms we have used the 
fact that Hii; = Ajj. 
Comparing (8.134) and (8.135), we see that we can rewrite (8.135) in the form 


The ambiguous sign + appears in front of v’v" because H,,, = +(y)’, the posi- 
tive sign being taken if H,, > 0 and the negative sign if H,, <0. 
Thus, after the substitution v = w(u), the expression (8.133) becomes the equal- 
ity 
(f ogoy ')(v) = YS “[4(v') | + > vv! Hi (v', ena a) 


i=l i,j>r 


where H; ; are new smooth functions that are symmetric with respect to the indices 
i and j. The mapping y o y~! is a diffeomorphism. Thus the induction from r — 1 
to r is now complete, and Morse’s lemma is proved. 


8.6.6 Problems and Exercises 


1. Compute the Jacobian of the change of variable (8.118) from polar coordinates 
to Cartesian coordinates in R”. 
2. a) Let xo be a noncritical point of a smooth function F : U > R defined in 
a neighborhood U of x9 = Ga: ...,X9') € R”. Show that in some neighborhood 
U CU of xo one can introduce curvilinear coordinates (é!,..., £””) such that the 
set of points defined by the condition F(x) = F (xo) will be given by the equation 
&” = 0 in these new coordinates. 

b) Let g,we Cc“ (D;R), and suppose that (g(x) = 0) > (W(x) = 0) in the 
domain D. Show that if grad g ¥ 0, then there is a decomposition yy = 6 - g in D, 
where 0 € C&—)(D; R). 


3. Let f : R* — R? be a smooth mapping satisfying the Cauchy—Riemann equa- 
tions 
df ar af! af? 


ax! ax2’ ax2 axl” 
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a) Show that the Jacobian of such a mapping is zero at a point if and only if 
f' (x) is the zero matrix at that point. 

b) Show that if f’(x) 4 0, then the inverse f—! to the mapping f is defined in 
a neighborhood of f and also satisfies the Cauchy—Riemann equations. 


4. Functional dependence (direct proof). 


a) Show that the functions 2'(x) =x! (i =1,...,m), regarded as functions of 
the point x = (x!,...,x”") € R”, form an independent system of functions in a 
neighborhood of any point of R”. 

b) Show that, for any function f € C(R”; R) the system z!,..., 2", f is func- 
tionally dependent. 

c) If the system of smooth functions f!,..., f*, k < m, is such that the rank 
of the mapping f = (f!,..., f*) equals k at a point x9 = (has ..+,%9) € R”, then 
in some neighborhood of this point one can complete it to an independent system 
f',..., f™ consisting of m smooth functions. 

d) If the system 


ea ik Comer a (i =1,...,m) 


of smooth functions is such that the mapping f = (f!,..., f’”) has rank m at the 
point x9 = Gs ...,X9'), then the variables (& ere &") can be used as curvilinear 
coordinates in some neighborhood U (xo) of x9, and any function g : U(xp) > R 
can be written as p(x) = F(f!(x),..., f’(x)), where F =o f7!. 

e) The rank of the mapping provided by a system of smooth functions is also 
called the rank of the system. Show that if the rank of a system of smooth functions 
Faseae ..., 4) G=1,...,k) is k and the rank of the system fs ..., f”, g is also 
k at some point x9 € R”, then g(x) = F(f!(x),..., f°) in a neighborhood of 
the point. 

Hint: Use c) and d) and show that 


BP af SES sce f 


5. Show that the rank of a smooth mapping f : R” — R” is a lower semicontinuous 
function, that is rank f(x) > rank f (xo) in a neighborhood of a point x9 € R”. 
6. a) Give a direct proof of Morse’s lemma for functions f :R —> R. 

b) Determine whether Morse’s lemma is applicable at the origin to the following 
functions: 


fae GOSas POSE ae 
X Xx 


f(x, y) =x? — 3xy’; f(x, y) =x. 


c) Show that nondegenerate critical points of a function f ¢ C®)(R”; R) are 
isolated: each of them has a neighborhood in which it is the only critical point of f. 
d) Show that the number k of negative squares in the canonical representation 
of a function in the neighborhood of a nondegenerate critical point is independent 
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of the reduction method, that is, independent of the coordinate system in which the 
function has canonical form. This number is called the index of the critical point. 


8.7 Surfaces in R” and the Theory of Extrema with Constraint 


To acquire an informal understanding of the theory of extrema with constraint, 
which is important in applications, it is useful to have some elementary informa- 
tion on surfaces (manifolds) in R”. 


8.7.1 k-Dimensional Surfaces in IR" 


Generalizing the concept of a law of motion of a point mass x = x(t), we have 
previously introduced the concept of a path in R” as a continuous mapping I”: J > 
R” of an interval J Cc R. The degree of smoothness of the path was defined a the 
degree of smoothness of this mapping. The support "(/) C R” of a path can be a 
rather peculiar set in R”, which it would be a great stretch to call a curve in some 
instances. For example, the support of a path might be a single point. 

Similarly, a continuous or smooth mapping f : /* — R” of a k-dimensional in- 
terval I C R*, called a singular k-cell in R", may have as its image f(J/*) not at 
all what one would like to call a k-dimensional surface in R”. For example, it might 
again be simply a point. 

In order for a smooth mapping f : G > R” of a domain G C R* to define a 
k-dimensional geometric figure in R” whose points are described by k independent 
parameters (t!, Ri tuie t*) € G, it suffices, as we know from the preceding section, to 
require that the rank of the mapping f : G > R” be k at each point ¢ € G (naturally, 
k <n). In that case the mapping f : G > f(G) is locally one-to-one (that is, in a 
neighborhood of each point f € G). 

Indeed, suppose rank f (#9) = k and this rank is realized, for example, on the first 
k of the n functions 


: (8.136) 
x= (tl... 04) 
that define the coordinate expressions for the mapping f : G > R”. 

Then, by the inverse function theorem the variables t!,..., 1“ can be expressed 
in terms of x!,..., x* in some neighborhood U (to) of to. It follows that the set 
Ff (U (to)) can be written as 

xktl Sree ate) ee ie =o" x", wig) 
(that is, it projects in a one-to-one manner onto the coordinate plane of x!,..., x*), 


and therefore the mapping f : U(to) > f(U(to)) is indeed one-to-one. 
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Fig. 8.9 


However, even the simple example of a smooth one-dimensional path (Fig. 8.9) 
makes it clear that the local injectivity of the mapping f : G > R” from the pa- 
rameter domain G into R” is by no means necessarily a global injectivity. The tra- 
jectory may have multiple self-intersections, so that if we wish to define a smooth 
k-dimensional surface in R” and picture it as a set that has the structure of a slightly 
deformed piece of a k-dimensional plane (a k-dimensional subspace of R”) near 
each of its points, it is not enough merely to map a canonical piece G C R ofa 
k-dimensional plane in a regular manner into R”. It is also necessary to be sure that 
it happens to be globally imbedded in this space. 


Definition 1 We shall call a set § C R” a k-dimensional smooth surface in R" (or a 
k-dimensional submanifold of R") if for every point x9 € S there exist a neighbor- 
hood U (xo) in R” and a diffeomorphism gy : U(x9) > I” of this neighborhood onto 
the standard n-dimensional cube J” = {t € R” | |t’| < 1,i=1,...,n} of the space 
RR” under which the image of the set SM U (xo) is the portion of the k-dimensional 
plane in R” defined by the relations r*+! = 0, ..., #” = 0 lying inside J” (Fig. 8.10). 


We shall measure the degree of smoothness of the surface S by the degree of 
smoothness of the diffeomorphism 9. 

If we regard the variables t!,...,t” as new coordinates in a neighborhood of 
U(xo), Definition 1 can be rewritten briefly as follows: the set S C R” is a k- 


U(xo) 
a 
i 
1 
-1 0 1 ¢ 
-1 


Fig. 8.10 
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dimensional surface (k-dimensional submanifold) in R” if for every point x9 € S 
there is a neighborhood U (xq) and coordinates t!,..., tf” in U(xo) such that in these 
coordinates the set SM U(x) is defined by the relations 


The role of the standard n-dimensional cube in Definition | is rather artificial 
and approximately the same as the role of the standard size and shape of a page in 
a geographical atlas. The canonical location of the interval in the coordinate system 
t!,...,f” is also a matter of convention and nothing more, since any cube in R” can 
always be transformed into the standard n-dimensional cube by an additional linear 
diffeomorphism. 

We shall often use this remark when abbreviating the verification that a set S C 
IR” is a surface in R”. 

Let us consider some examples. 


Example 1 The space R” itself is an n-dimensional surface of class C‘). As the 
mapping gy : R” — J” here, one can take, for example, the mapping 


i_ 2 a 
é' =—arctanx’ (i=1,...,n). 
a 


Example 2 The mapping constructed in Example | also establishes that the sub- 
space of the vector space R” defined by the conditions x*+! =...= x" =Oisa 
k-dimensional surface in R” (or a k-dimensional submanifold of R”). 


Example 3 The set in R” defined by the system of relations 
ae fee aa + aj, xktt fee ax =0, 


antl ppt kak 4 ath] 4... + ann” =0, 


provided this system has rank n — k, is a k-dimensional submanifold of R”. 
Indeed, suppose for example that the determinant 


Tey ay 
n—-k n—k 
G41 7° an 
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k+1 


tT = glyl 4... talyx", 


t" = atl te pe gt yn, 


is obviously nondegenerate. In the coordinates t!,..., t” the set is defined by the 
conditions r+! =... = ¢” =0, already considered in Example 2. 


Example 4 The graph of a smooth function x” = f(x!,...,x”~!) defined in a do- 
main G C R”! is a smooth (n — 1)-dimensional surface in R”. 
Indeed, setting 


we obtain a coordinate system in which the graph of the function has the equation 
r” =0. 


Example 5 The circle x* + y? = 1 in R? is a one-dimensional submanifold of R?, as 
is established by the locally invertible conversion to polar coordinates (p, g) studied 
in the preceding section. In these coordinates the circle has equation p = 1. 


Example 6 This example is a generalization of Example 3 and at the same time, as 
can be seen from Definition 1, gives a general form for the coordinate expression of 
submanifolds of R”. 

Let Fi(x!,...,x") G@=1,...,n—k) bea system of smooth functions of rank 
n — k. We shall show that the relations 


PE ce Ge ace) =0, 
: (8.137) 
lames 0 eee oe ane igh”) =0 
define a k-dimensional submanifold S in R”. 
Suppose the condition 
af! F 
axkt+1 ax” 
(xo) #0 (8.138) 
aRpr-k — apr-k 
OxeI ox 


holds at a point x9 € S. Then by the inverse function theorem the transformation 


t=x' (i=1,...,k), 
BaP Ee") Cah ltayn) 


is a diffeomorphism of a neighborhood of this point. 
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In the new coordinates t!,...,t” the original system will have the form t*+! = 
-+» =f” =0; thus, S is a k-dimensional smooth surface in R”. 


Example 7 The set E of points of the plane R? satisfying the equation x* — y* = 0 
consists of two lines that intersect at the origin. This set is not a one-dimensional 
submanifold of R? (verify this!) precisely because of this point of intersection. 

If the origin 0 € R? is removed from E, then the set E\0 will now obviously 
satisfy Definition 1. We remark that the set E\0 is not connected. It consists of four 
pairwise disjoint rays. 

Thus a k-dimensional surface in R” satisfying Definition | may happen to be a 
disconnected subset consisting of several connected components (and these compo- 
nents are connected k-dimensional surfaces). A surface in R” is often taken to mean 
a connected k-dimensional surface. Just now we shall be interested in the problem 
of finding extrema of functions defined on surfaces. These are local problems, and 
therefore connectivity will not manifest itself in them. 


Example 8 If a smooth mapping f : G > R” of the domain G C R” defined in 
coordinate form by (8.136) has rank k at the point fo € G, then there exists a neigh- 
borhood U (to) C G of this point whose image f(U(to)) C R” is a smooth surface 
in R”. 

Indeed, as already noted above, in this case relations (8.136) can be replaced by 
the equivalent system 


xh = ght(x1, xk), 
; (8.139) 
x” =x", ean) 

in some neighborhood U (fo) of to € G. (For simplicity of notation, we assume that 

the system f!,..., f* has rank k.) Setting 


PG Sa Sao ease") (i=1,...,n—k), 


we write the system (8.139) in the form (8.137). Since relations (8.138) are satis- 
fied, Example 6 guarantees that the set f(U (to)) is indeed a k-dimensional smooth 
surface in R”. 


8.7.2 The Tangent Space 


In studying the law of motion x = x(t) of a point mass in R°, starting from the 
relation 


x(t)=x(0)+x/(0)t +o(t) ast>0 (8.140) 


and assuming that the point ¢ = 0 is not a critical point of the mapping R 3 tf 
x(t) € R3, that is, x’(0) 4 0, we defined the line tangent to the trajectory at the point 
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x(0) as the linear subset of R? given in parametric form by the equation 

x— x9 =x'(O)t (8.141) 
or the equation 

x—xo=€-T, (8.142) 


where xo = x(0) and = x’(0) is a direction vector of the line. 

In essence, we did a similar thing in defining the tangent plane to the graph 
of a function z = f(x, y) in R°. Indeed, supplementing the relation z = f (x, y) 
with the trivial equalities x = x and y = y, we obtain a mapping R? 5 (x, y) 
(x, y, f(x, y)) € R? to which the tangent at the point (xo, yo) is the linear mapping 


x — x0 1 0 a 
y-yo]= 0 1 ( 7 ), (8.143) 
z= 2 fc (%0, Yo) fo, Yo) eae 


where zo = f (x0, yo). 

Setting t = (x — x9, y — yo) and x = (x — x0, y — Yo, Z — Zg) here, and denoting 
the Jacobi matrix in (8.143) for this transformation by x’(0), we remark that its rank 
is two and that in this notation relation (8.143) has the form (8.141). 

The peculiarity of relation (8.143) is that only the last equality in the set of three 
equalities 


x—x9=xX—X0, 
y-yo=y—yo, (8.144) 
z— 20 = fy (x0, Yo)(x — x0) + fo, y — O)(Y — Yo), 


to which it is equivalent is a nontrivial relation. That is precisely the reason it is 
retained as the equation defining the plane tangent to the graph of z = f(x, y) at 
(x0, YO, ZO). 

This observation can now be used to give the definition of the k-dimensional 
plane tangent to a k-dimensional smooth surface S C R”. 

It can be seen from Definition | of a surface that in a neighborhood of each of 
its points xo € S a k-dimensional surface S can be defined parametrically, that is, 
using mappings I‘ 5 (t!,...,t%) + (!,..., x”) € S. Such a parametrization can 
be taken to be the restriction of the mapping y~! : J” — U(xo) to the k-dimensional 
plane r*+! =... = +” =0 (see Fig. 8.10). 

Since g7! is a diffeomorphism, the Jacobian of the mapping g~! : I” > U(xo) 
is nonzero at each point of the cube J”. But then the mapping I‘ 3 (t!,..., 1%) 1b 
(x!,...,x”) € S obtained by restricting y~! to this plane must also have rank k at 
each point of /*. 

Now setting (t!,...,t*) =t € * and denoting the mapping I‘ 3 tt x € S by 
x = x(t), we obtain a local parametric representation of the surface S possessing 
the property expressed by (8.140), on the basis of which we take Eq. (8.141) as the 
equation of the tangent space or tangent plane to the surface S C R” at x9 € S. 

Thus we adopt the following definition. 
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Definition 2 If a k-dimensional surface S C R”, 1 < k <n, is defined parametri- 
cally in a neighborhood of x9 € S by means of a smooth mapping (t!,...,t*) = 
tr x =(x!,...,x”) such that x9 = x(0) and the matrix x’(0) has rank k, then the 
k-dimensional surface in IR” defined parametrically by the matrix equality (8.141) 
is called the tangent plane or tangent space to the surface S at xq € S. 


In coordinate form the following system of equations corresponds to Eq. (8.141): 


ee Oca ee 
ie ae Ts ark : 
: (8.145) 


ax” ax” 
x= xp = TON +--+ 5g OF. 


We shall denote!” the tangent space to the surface S at x € S, as before, by TSy. 

An important and useful exercise, which the reader can do independently, is to 
prove the invariance of the definition of the tangent space and the verification that 
the linear mapping f +> x’(0)r tangent to the mapping t +> x(t), which defines the 
surface S locally, provides a mapping of the space R* = TRE onto the plane TS, 0) 
(see Problem 3 at the end of this section). 

Let us now determine the form of the equation of the tangent plane to the k- 
dimensional surface S$ defined in R” by the system (8.137). For definiteness we 
shall assume that condition (8.138) holds in a neighborhood of the point xg € S. 

Setting (x!,..., x") =u, (x*+!,...,x") =, (F!,..., F"—») = F, we write the 
system (8.137) in the form 


F(u, v) =0, (8.146) 
and (8.138) as 
det F;(u, v) £0. (8.147) 
Using the implicit function theorem, in a neighborhood of the point (uo, vo) = 
as ees i ; ae ceed xo) we pass from relation (8.146) to the equivalent relation 


v= f(u), (8.148) 


which, when we supplement it with the identity uw = u, yields the parametric repre- 
sentation of the surface S in a neighborhood of xg € S: 


u=uU, 


8.149 
v= f(u). ee) 


‘0This is a slight departure from the usual notation Ty S or T;(S). 
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On the basis of Definition 2 we obtain from (8.149) the parametric equation 


u—ug=E-t, 
(8.150) 
v—vo = f"(uo)-t 


of the tangent plane; here E is the identity matrix and t = u — uo. 
Just as was done in the case of the system (8.144), we retain in the system (8.150) 
only the nontrivial equation 


v— v9 = f'(uo)(u — uo), (8.151) 


k 


which contains the connection of the variables x!,...,x* with the variables 


x*+!__., x" that determine the tangent space. 
Using the relation 


=i 
f' (uo) = —[F) uo, vo) | [Fi (uo, v0) ], 
which follows from the implicit function theorem, we rewrite (8.151) as 
F/(uo, vo) (u — uo) + F/(uo, vo)(v — vo) = 0, 


from which, after returning to the variables (x h ..., 4") = x, we obtain the equation 
we are seeking for the tangent space TS,, C R", namely 


F.(x0)(x — x9) = 0. (8.152) 


In coordinate representation Eq. (8.152) is equivalent to the system of equations 


aF! oF! 
Far Hole! — x9) +.+ 5aw (0)(x" — x9) =0, 
: (8.153) 
n—k apn-k 
(xo)(x! — xg) Fee + (xo) (x" — xp) =0. 


ax} axt 


By hypothesis the rank of this system is n — k, and hence it defines a k- 
dimensional plane in R”. 

The affine equation (8.152) is equivalent (given the point xo) to the vector equa- 
tion 


F’ (xo) -€ =0, (8.154) 


in which € = x — xo. 

Hence the vector & lies in the plane TS,, tangent at xo € S to the surface S C R” 
defined by the equation F(x) = 0 if and only if it satisfies condition (8.154). Thus 
TSx, can be regarded as the vector space consisting of the vectors € that satisfy 
(8.154). 

It is this fact that motivates the use of the term tangent space. 

Let us now prove the following proposition, which we have already encountered 
in a special case (see Sect. 6.4). 
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Proposition The space TS;, tangent to a smooth surface S C R" at a point xp € S 
consists of the vectors tangent to smooth curves lying on the surface S and passing 
through the point xo. 


Proof Let the surface S be defined in a neighborhood of the point xo € S by asystem 
of equations (8.137), which we write briefly as 


F(x) =0, (8.155) 


where F = (F!,..., F"~*), x = (x!,...,x"). Let 7: I > S be an arbitrary 
smooth path with support on S. Taking J = {t € R | |t| < 1}, we shall assume that 
x(0) = xo. Since x(t) € S for t € J, after substituting x(t) into Eq. (8.155), we 
obtain 


F(x(t)) =0 (8.156) 


for t € J. Differentiating this identity with respect to t, we find that 
F’. (x(t)) -x/(t) =0. 
In particular, when t = 0, setting € = x’(0), we obtain 
Fi(xo)é = 0, 


that is, the vector € tangent to the trajectory at xo (at time t = 0) satisfies Eq. (8.154) 
of the tangent space TS,,. 

We now show that for every vector & satisfying Eq. (8.154) there exists a smooth 
path I”: J — S that defines a curve on S passing through xo at t = 0 and having the 
velocity vector & at time t = 0. 

This will simultaneously establish the existence of smooth curves on S passing 
through x9, which we assumed implicitly in the proof of the first part of the propo- 
sition. 

Suppose for definiteness that condition (8.138) holds. Then, knowing the first 
k coordinates €!,...,é* of the vector € = (€!,...,&*,e4*!,...,&"), we de- 
termine the other coordinates €**+!,...,&" uniquely from Eq. (8.154) (which 
is equivalent to the system (8.153)). Thus, if we establish that a vector é = 
(Bou eter, ‘ pt) satisfies Eq. (8.154), we can conclude that é =&. We 
shall make use of this fact. 

Again, as was done above, we introduce for convenience the notation u = 
(xt), v= the), x = (x! x”) = (u,v), and F(x) = F(u, v). 
Then Eq. (8.155) will have the form (8.146) and condition (8.138) will have the 
form (8.147). In the subspace R* C R” of the variables x!,...,x* we choose a 
parametrically defined line 

xi a, =é!4, 
teR, 


k k k 
x" — x5 = Et, 
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having direction vector (€ as é k) which we denote &,. In more abbreviated no- 
tation this line can be written as 


“= Ug + yt. (8.157) 


Solving Eq. (8.146) for v, by the implicit function theorem we obtain a smooth 
function (8.148), which, when the right-hand side of Eq. (8.157) is substituted as its 
argument and (8.157) is taken account of, yields a smooth curve in R” defined as 
follows: 


u=uot éut, 


gates. oO (8.158) 


Since F(u, f (u)) = 0, this curve obviously lies on the surface S. Moreover, it is 
clear from Eqs. (8.158) that at t = 0 the curve passes through the point (ug, v9) = 
C) eee ae ee 9 ea S. 


Differentiating the identity 
F (u(t), v(t)) = F(uo + ut, fo + Eut)) =0 
with respect to t, we obtain for t = 0 
Fy, (uo, vou + Fy (uo, v0)&v = 0, 


where &, = v'(0) = (é*+!,...,&"). This equality shows that the vector — = 
(&,, €) = (E!,...,&*, E!, ..., E”) satisfies Eq. (8.154). Thus by the remark made 
above, we conclude that =. But the vector is the velocity vector at t = 0 for 
the trajectory (8.158). The proposition is now proved. 


8.7.3 Extrema with Constraint 


a. Statement of the Problem 


One of the most brilliant and well-known achievements of differential calculus is the 
collection of recipes it provides for finding the extrema of functions. The necessary 
conditions and sufficient differential tests for an extremum that we obtained from 
Taylor’s theorem apply, as we have noted, to interior extrema. 

In other words, these results are applicable only to the study of the behavior 
of functions R” 5 x + f(x) € R in a neighborhood of a point x9 € IR”, when the 
argument x can assume any value in some neighborhood of x9 in R”. 

Frequently a situation that is more complicated and from the practical point of 
view even more interesting arises, in which one seeks an extremum of a function un- 
der certain constraints that limit the domain of variation of the argument. A typical 
example is the isoperimetric problem, in which we seek a body of maximal volume 
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subject to the condition that its boundary surface has a fixed area. To obtain a mathe- 
matical expression for such a problem that will be accessible to us, we shall simplify 
the statement and assume that the problem is to choose from the set of rectangles 
having a fixed perimeter 2p the one having the largest area o. Denoting the lengths 
of the sides of the rectangle by x and y, we write 


a(x, h)=x-y, 
X+y=p. 


Thus we need to find an extremum of the function o (x, y) under the condition 
that the variables x and y are connected by the equation x + y = p. Therefore, the 
extremum is being sought only on the set of points of R? satisfying this relation. 
This particular problem, of course, can be solved without difficulty: it suffices to 
write y = p — x and substitute this expression into the formula for o(x, y), then 
find the maximum of the function x(p — x) by the usual methods. We needed this 
example only to explain the statement of the problem itself. 

In general the problem of an extremum with constraint usually amounts to finding 
an extremum for a real-valued function 


VST ie) (8.159) 


of n variables under the condition that these variables must satisfy a system of equa- 
tions 


: (8.160) 
cas ee a | =0. 


Since we are planning to obtain differential conditions for an extremum, we shall 
assume that all these functions are differentiable and even continuously differen- 
tiable. If the rank of the system of functions F ae , F™ isn —k, conditions (8.160) 
define a k-dimensional smooth surface S in R”, and from the geometric point of 
view the problem of extremum with constraint amounts to finding an extremum of 
the function f on the surface S. More precisely, we are considering the restriction 
J \s of the function f to the surface S and seeking an extremum of that function. 

The meaning of the concept of a local extremum itself here, of course, remains 
the same as before, that is, a point x9 € S is a local extremum of f on S, or, more 
briefly f|s, if there is a neighborhood!! Us(xo) of xo in § CR” such that f(x) > 
f (xo) for any point x € Us(xo) (in which case xo is a local minimum) or f(x) < 
f (xo) (and then xo is a local maximum). If these inequalities are strict for x € 
Us(xo)\xo, then the extremum, as before, will be called strict. 


'lWe recall that Us (xo) = SM U(x), where U(x) is a neighborhood of xo in R”. 
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b. A Necessary Condition for an Extremum with Constraint 


Theorem 1 Let f : D— R be a function defined on an open set D C R" and be- 
longing to C“) (D; R). Let S be a smooth surface in D. 

A necessary condition for a point xo € S that is noncritical for f to be a local 
extremum of f |5 is that 


TS, c IN, (8.161) 


where TS,, is the tangent space to the surface S at xp and TN, is the tangent space 
to the level surface N = {x € D| f(x) = f (xo)} of f to which xo belongs. 


We begin by remarking that the requirement that the point x9 be non-critical for 
f is not an essential restriction in the context of the problem of finding an extremum 
with constraint, which we are discussing. Indeed, even if the point x9 € D were a 
critical point of the function f : D > R or an extremum of the function, it is clear 
that it would still be a possible or actual extremum respectively for the function fs. 
Thus, the new element in this problem is precisely that the function f|s may have 
critical points and extrema that are different from those of /. 


Proof We choose an arbitrary vector € € TS,, and a smooth path x = x(t) on S that 
passes through this point at t = 0 and for which the vector & is the velocity at t = 0, 
that is, 


ic 8.162 
a (8.162) 


If xo is an extremum of the function f|s5, the smooth function f(x (t)) must have 
an extremum at tf = 0. By the necessary condition for an extremum, its derivative 
must vanish at t = 0, that is, we must have 


f' (xo) -& =0, (8.163) 


where 


a a 
fsa) =( f Ba), é=(é!,...,8"). 


ax!’ ox” 


Since xg is a noncritical point of f, condition (8.163) is equivalent to the condi- 
tion that € ¢ TN x,, for relation (8.163) is precisely the equation of the tangent space 
TN xy- 

Thus we have proved that TS, C TN xo. 


If the surface S is defined by the system of equations (8.160) in a neighborhood 
of xo, then the space TS,,, as we know, is defined by the system of linear equa- 
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tions 
OF! aF! 
pet COE! +--+ + a Gods" =O, 
: (8.164) 
m of” 
aa COE! +--+ (x0) 6" = 0. 
The space TN ,, is defined by the equation 
0 0 
oF (ao)8! sp ee=p or (x0)&" = 0, (8.165) 


and, since every solution of (8.164) is a solution of (8.165), the latter equation is a 
consequence of (8.163). 

It follows from these considerations that the relation TS;, C TN x9 is equivalent 
to the analytic statement that the vector grad f(x) is a linear combination of the 
vectors grad F'(xo) G =1,...,m), that is, 


m 


grad f (xo) = ) >A; grad F! (xo). (8.166) 


i=1 


Taking account of this way of writing the necessary condition for an extremum of 
a function (8.159) whose variables are connected by (8.160), Lagrange proposed 
using the following auxiliary function when seeking a constrained extremum: 


L(x, A) = f(x) — oa F' (a) (8.167) 


i=l 


inn +m variables (x, A) = (x!,...,.x,Aq,..-, An): 

This function is called the Lagrange function and the method of using it is the 
method of Lagrange multipliers. 

The function (8.167) is convenient because the necessary conditions for an ex- 
tremum of it, regarded as a function of (x, A) = (x!, 22, X’",A1,...,An), are pre- 
cisely (8.166) and (8.160). 

Indeed, 


aL of = Fi 
agi OM = 57) dig 7) =0 Slum), 
= (8.168) 


ee d) = —F'(x) =0 i= 1 
a J=-F' x)= (i=1,...,m). 


Thus, in seeking an extremum of a function (8.159) whose variables are sub- 
ject to the constraints (8.160), one can write the Lagrange function (8.167) with 
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undetermined multipliers and look for its critical points. If it is possible to find 
x= Cae ...,X9) from the system (8.168) without finding 4 = (A, ..., Am), then, 
as far as the original problem is concerned, that is what should be done. 

As can be seen from (8.166), the multipliers 4; @ = 1,...,m) are uniquely de- 
termined if the vectors grad F ‘(x9) (i =1,...,m) are linearly independent. The 
independence of these vectors is equivalent to the statement that the rank of the sys- 
tem (8.164) is m, that is, that all the equations in this system are essential (none of 
them is a consequence of the others). 

This is usually the case, since it is assumed that all the relations (8.160) are 
independent, and the rank of the system of functions F!,..., F’”” is m at every 
point x eX. 

The Lagrange function is often written as 


m 


L(x, A) = f(x) + ai F' (x), 
i=l 
which differs from the preceding expression only in the inessential replacement of 
Aji by =e 


Example 9 Let us find the extrema of a symmetric quadratic form 


fa)= a ajjx'x/ (aij = ji) (8.169) 


ga! 


on the sphere 


n 


F(x) => (x')’-1=0. (8.170) 


i=1 
Let us write the Lagrange function for this problem 
n n 
L(x, A) = a ax a (doi - . 
i,j=l i=1 


and the necessary conditions for an extremum of L(x, A), taking account of the 
relation aj; = aji: 


aL ee 
at A) = 2 aya! — 10 |= 0. GH 1,..557), 


j=l 


OL n F 
Fon=-(Yt i ) nt, 


i=1 


(8.171) 


'2Tn regard to the necessary criterion for an extremum with constraint, see also Problem 6 in 
Sect. 10.7 (Part 2). 
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Multiplying the first equation by x’ and summing the first relation over i, we 
find, taking account of the second relation, that the equality 


n 
Yo ajjxixi —2=0 (8.172) 
i,j=!1 


must hold at an extremum. 
The system (8.171) minus the last equation can be rewritten as 


n 
Yiaijxi sax! G=1,...,n), (8.173) 
i=l 


from which it follows that A is an eigenvalue of the linear operator A defined by the 
matrix (a;;), and x = (x!,...,x”) is an eigenvector of this operator corresponding 
to this eigenvalue. 

Since the function (8.169), which is continuous on the compact set S = {x € R” | 
ai (x!)? = 1}, must assume its maximal value at some point, the system (8.171), 
and hence also (8.173), must have a solution. Thus we have established along the 
way that every real symmetric matrix (a;;) has at least one real eigenvalue. This 
is a result well-known from linear algebra, and is fundamental in the proof of the 
existence of a basis of eigenvectors for a symmetric operator. 

To show the geometric meaning of the eigenvalue A, we remark that if 7 > 0, 
then, passing to the coordinates f= rare we find, instead of (8.172), 


> ajjtit? =1, (8.174) 


and, instead of (8.170), 
ryas—. 8.175 
Ley =; (8.175) 


But yo 5 (t')? is the square of the distance from origin to the point t = 
(t!,...,f”), which lies on the quadric surface (8.174). Thus if (8.174) represents, 
for example, an ellipsoid, then the reciprocal 1/4 of the eigenvalue A is the square 
of the length of one of its semi-axes. 

This is a useful observation. It shows in particular that relations (8.171), which 
are necessary conditions for an extremum with constraint, are still not sufficient. 
After all, an ellipsoid in IR? has, besides its largest and smallest semi-axes, a third 
semi-axis whose length is intermediate between the two, in any neighborhood of 
whose endpoint there are both points nearer to the origin and points farther away 
from the origin than the endpoint. This last becomes completely obvious if we con- 
sider the ellipses obtained by taking a section of the original ellipsoid by two planes 
passing through the intermediate-length semi-axis, one passing through the smallest 
semi-axis and the other through the largest. In one of these cases the intermediate 
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axis will be the major semi-axis of the ellipse of intersection. In the other it will be 
the minor semi-axis. 

To what has just been said we should add that if 1/./A is the length of this inter- 
mediate semi-axis, then, as can be seen from the canonical equation of an ellipsoid, 
A will be an eigenvalue of the operator A. Therefore the system (8.171), which ex- 
presses the necessary conditions for an extremum of the function f|s5, will indeed 
have a solution that does not give an extremum of the function. 

The result obtained in Theorem | (the necessary condition for an extremum with 
constraint) is illustrated in Fig. 8.1 1a and 8.1 1b. 

Figure 8.1 1a explains why the point xo of the surface S cannot be an extremum 
of f|s if S is not tangent to the surface N = {x € R” | f(x) = f (xo) = co} at xo. It 
is assumed here that grad f (xo) 4 0. This last condition guarantees that in a neigh- 
borhood of xg there are points of a higher, c2-level surface of the function f and 
also points of a lower, c-level surface of the function. 

Since the smooth surface S intersects the surface N, that is, the co-level surface 
of the smooth function f, it follows that $ will intersect both higher and lower level 
surfaces of f in a neighborhood of xo. But this means that x9 cannot be an extremum 
of fls. 

Figure 8.1 1b shows why, when N is tangent to S at xo, this point may turn out to 
be an extremum. In the figure xo is a local maximum of /|s. 

These same considerations make it possible to sketch a picture whose analytic 
expression can show that the necessary criterion for an extremum is not sufficient. 

Indeed, in accordance with Fig. 8.12, we set, for example, 


fa y=y, F(x,y)=x°—y=0. 


It is then obvious that y has no extremum at the point (0,0) on the curve S$ C 
IR? defined by the equation y = x°, even though this curve is tangent to the level 
line f(x, y) =0 of the function f at that point. We remark that grad f(0,0) = 


(0,1) 40. 
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It is obvious that this is essentially the same example that served earlier to illus- 
trate the difference between the necessary and sufficient conditions for a classical 
interior extremum of a function. 


c. A Sufficient Condition for a Constrained Extremum 


We now prove the following sufficient condition for the presence or absence of a 
constrained extremum. 


Theorem 2 Let f : D— R be a function defined on an open set D C R" and be- 
longing to the class C®(D; R); let S be the surface in D defined by Eqs. (8.160), 
where Fi € C2 (D; R) @ =1,...,m) and the rank of the system of functions 
{F!,..., F"} at each point of D ism. 

Suppose that the parameters i, ..., Am in the Lagrange function 


ELM SaLGE a7 (x teat) = > A eee") 


i=1 


have been chosen in accordance with the necessary criterion (8.166) for a con- 
strained extremum of the function f\s at xo € S.' 
A sufficient condition for the point xo to be an extremum of the function f |g is 
that the quadratic form 
aE Cygigi (8.176) 
axtaxi ©? ; 
be either positive-definite or negative-definite for vectors § € TS xo. 
If the quadratic form (8.176) is positive-definite on TSx,, then xq is a strict local 
minimum of f \s; if it is negative-definite, then xo is a strict local maximum. 
A sufficient condition for the point xq not to be an extremum of f\s is that the 
form (8.176) assume both positive and negative values on TS xq. 


Proof We first note that L(x) = f(x) for x € S, so that if we show that xo € S is 
an extremum of the function L|s, we shall have shown simultaneously that it is an 
extremum of fs. 

By hypothesis, the necessary criterion (8.166) for an extremum of f|s at xo is 
fulfilled, so that grad L (xo) = 0 at this point. Hence the Taylor expansion of L(x) in 
a neighborhood of xo = (xd, ae xo) has the form 


1 0L 2 ee 
-(xo)(x! — x4) (x4 — x9) + o(llx — xoll?) (8.177) 


L(x) — LQ) = 5 ral 


as xX > Xo. 


'3When we keep A fixed, L(x; A) becomes a function depending only on x; we allow ourselves to 
denote this function L(x). 
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We now recall that, in motivating Definition 2, we noted the possibility of a local 
(for example, in a neighborhood of xo € S) parametric definition of a smooth k- 
dimensional surface S (in the present case, k =n — m). 

In other words, there exists a smooth mapping 


Ra(e’,...,e) =tex=(x',...,2") ER" 


(as before, we shall write it in the form x = x(t)), under which a neighborhood of 
the point 0 = (0,...,0) € R* maps bijectively to some neighborhood of xo on S, 
and x9 = x(0). 

We remark that the relation 


x(t) —x(0) =x/(O)t + o(lt|]) ast > 0, 
which expresses the differentiability of the mapping ft +> x(t) at t = 0, is equivalent 


to the coordinate equalities 


i i Ox! ya . 
x'(t)—x'(0) = ae +o(IItl) @=1,...,n), (8.178) 


in which the index @ ranges over the integers from | to k and the summation is over 
this index. 
It follows from these numerical equalities that 


|x! (tf) — x'(0)| = O((Itl|) ast > 0 
and hence 
x(t) — x) on = O(ltllge) ast > 0. (8.179) 
Using relations (8.178), (8.179), and (8.177), we find that as t > 0 


1 ; 
L(x(t)) - L(x (0)) = 5 bi EO) Bax! (0)dgx(O)t*r? + o( tI”). (8.177') 
Hence under the assumption of positive- or negative-definiteness of the form 
jj L (x0) Bax! (0)dg.x/ (O)t%t? (8.180) 


it follows that the function L(x(t)) has an extremum at t = 0. If the form (8.180) 
assumes both positive and negative values, then L(x(t)) has no extremum at ¢ = 0. 
But, since some neighborhood of the point 0 € R* maps to a neighborhood of x (0) = 
xo € S on the surface S under the mapping t + x(t), we can conclude that the 
function L|s5 also will either have an extremum at x9 of the same nature as the 
function L(x(t)) or, like L(x(t)), will not have an extremum. 

Thus, it remains to verify that for vectors € € TS;, the expressions (8.176) and 
(8.180) are merely different notations for the same object. 

Indeed, setting 


é=x'(O)t, 
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we obtain a vector € tangent to S$ at xo, and if € = CO ang h ly x(t) = 
(xt, ...,x")(), andt =(t!,..., 7%), then 


&=dgx/(O)t® (j=1,...,n), 


from which it follows that the quantities (8.176) and (8.180) are the same. 


We note that the practical use of Theorem 2 is hindered by the fact that only 
k =n—m Of the coordinates of the vector € = (€!,...,€”) € TS ‘xq are independent, 
since the coordinates of € must satisfy the system (8.164) defining the space TS,,. 
Thus a direct application of the Sylvester criterion to the quadratic form (8.176) 
generally yields nothing in the present case: the form (8.176) may not be positive- 
or negative-definite on TR‘, and yet be definite on TS,,. But if we express m coor- 
dinates of the vector € in terms of the other k coordinates by relations (8.164) and 
then substitute the resulting linear forms into (8.176), we arrive at a quadratic form 
in k variables whose positive- or negative-definiteness can be investigated using the 
Sylvester criterion. 


Let us clarify what has just been said by some elementary examples. 


Example 10 Suppose we are given the function 
f@yDar—-y~ +7 


in the space R? with coordinates x, y, z. We seek an extremum of this function on 
the plane S defined by the equation 


F(x, y,z)=2x —y—3=0. 
Writing the Lagrange function 
L(x, y, 2) = (x? — y? +27) —A(2x — y—-3) 


and the necessary conditions for an extremum 


OL 

— =2x —22=0, 

Ox 

7) 

— =-—2y+A=0, 

dy 

OL 

—W-?2 =U, 

0z : 

oe (2 3)=0 
aA x —y =U, 


we find the possible extremum p = (2, 1,0). 
Next we find the form (8.176): 


1 a 
5 oj LENE! = (e!)’ — 67)" + (@)7. (8.181) 
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We note that in this case the parameter A did not occur in this quadratic form, 
and so we did not compute it. 
We now write the condition € € TS): 


of = 2? = 0, (8.182) 


From this equality we find £7 = 2é! and substitute it into the form (8.181), after 
which it assumes the form 


2 2 
3) + EY’, 
where this time €! and £3 are independent variables. 


This last form may obviously assume both positive and negative values, and 
therefore the function f|5 has no extremum at p € S. 


Example 11 Under the hypotheses of Example 10 we replace R* by R? and the 
function f by 


faye sy, 
retaining the condition 
2x —y—3=0, 


which now defines a line S in the plane R?. 
We find p = (2, 1) as a possible extremum. 
Instead of the form (8.181) we obtain the form 


(g1)? — (2)? (8.183) 


with the previous relation (8.182) between &! and é?. 
Thus the form (8.183) now has the form 


—3(E!)° 


on TS p, that is, it is negative-definite. We conclude from this that the point p = (2, 1) 
is a local maximum of fs. 

The following simple examples are instructive in many respects. On them one 
can distinctly trace the working of both the necessary and the sufficient conditions 
for constrained extrema, including the role of the parameter and the informal role of 
the Lagrange function itself. 


Example 12 On the plane R* with Cartesian coordinates (x, y) we are given the 
function 


fx,y=x? ty’, 


8.7 Surfaces in R” and Constrained Extrema 537 


Let us find the extremum of this function on the ellipse given by the canonical 
relation 
2 


Ce ee 


where 0<a <b. 

It is obvious from geometric considerations that min f|s = a* and max f|s = b?. 
Let us obtain this result on the basis of the procedures recommended by Theorems 1 
and 2. 

By writing the Lagrange function 


2 2 
XxX 
L(x, y, A) = (x2 +?) -(5+ ie 1) 
a 


and solving the equation dZ = 0, that is, the system or = 4L — 4. — 0), we find the 


dy ~ dA 
solutions 
(x,y, A) =(ta,0,a*), (0, £b, b’). 


Now in accordance with Theorem 2 we write and study the quadratic form 
5a°Lé >| the second term of the Taylor expansion of the Lagrange function in a 
neighborhood of the corresponding points: 


rere? =(1-4)ers (1A) ery. 


At the points (+a, 0) of the ellipse S the tangent vector € = (€ - é *) has the form 
(0,é 2). and for 4 = a? the quadratic form assumes the form 


Taking account of the condition 0 < a < b, we conclude that this form is positive- 
definite and hence at the points (+a, 0) € S there is a strict local (and in this case 
obviously also global) minimum of the function f|s, that is, min f|s5 =a’. 


Similarly we find the form 
BY 9.452 
(ae 


which corresponds to the points (0, +b) € S, and we find max f|s5 = b*. 


Remark Note the role of the Lagrange function here compared with the role of the 
function f. At the corresponding points on these tangent vectors the differential of f 
(like the differential of L) vanishes, and the quadratic form 5 d* fé 2 = (€!)2 4 (€)? 
is positive definite at whichever of these points it is computed. Nevertheless, the 
function f|s5 has a strict minimum at the points (a, 0) and a strict maximum at the 
points (0, +b). 
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To understand what is going on here, look again at the proof of Theorem 2 and try 
to obtain relation (8.176) by substituting f for L in (8.177). Note that an additional 
term containing x” (0) arises here. The reason it does not vanish is that, in contrast 
to dL the differential df of f is not identically zero at the corresponding points, 
even though its values are indeed zero on the tangent vectors (of the form x’(0)). 


Example 13 Let us find the extrema of the function 
f@y2=r+y7 $2 


on the ellipsoid S defined by the relation 


where O<a<b<c. 
By writing the Lagrange function 


ae x2 y? 7 
Lx, y,zA)= (xi +y° +2") (5 tata i) 
in accordance with the necessary criterion for an extremum, we find the solutions of 


: _ : aL _ dL _ dL __ aL : 
the equation dL = 0, that is, the system = 0: 


(x, y,z,A) =(+a,0,0,a7), (0,4b,0,b’), (0,0, +c, c*). 


On each respective tangent plane the quadratic form 


5 Pe? = (1 _ * te) 4 (: = =) («)? + (1 - =)ey 


in each of these cases has the form 
(1 = =e) + (1 - Sey. (a) 
(1 - =e? + (1 - 7 Yer. (b) 
(1 - S)ey + (1 - =) (c) 


Since 0 < a < b <c, it follows from Theorem 2, which gives a sufficient criterion 
for the presence or absence of a constrained extremum, that one can conclude that 
in cases (a) and (c), we have found respectively min f|5 = a” and max f\ls= c, 
while at the points (0, +b, 0) € S corresponding to case (b) the function f|s5 has no 
extremum. This is in complete agreement with the obvious geometric considerations 
stated in the discussion of the necessary criterion for a constrained extremum. 
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Certain other aspects of the concepts of analysis and geometry encountered in 
this section, which are sometimes quite useful, including the physical interpretation 
of the problem of a constrained extremum itself, as well as the necessary criterion 
(8.166) for it as the resolution of forces at an equilibrium point and the interpretation 
of the Lagrange multipliers as the magnitude of the reaction of ideal constraints, are 
presented in the problems and exercises that follow. 


8.7.4 Problems and Exercises 


1. Paths and surfaces. 


a) Let f : J > R* be a mapping of class C“ (J; R7) of the interval J C R. 
Regarding this mapping as a path in R?, show by example that its support f (J) may 
fail to be a submanifold of R*, while the graph of this mapping in R? = R! x R? 
is always a one-dimensional submanifold of R* whose projection into R? is the 
support f (7) of the path. 

b) Solve problem a) in the case when / is an interval in R* and f ¢ CJ; R”). 
Show that in this case the graph of the mapping f : J — R” is a smooth k- 
dimensional surface in R‘ x R” whose projection on the subspace R” equals f (1). 

c) Verify that if f; :l; > S and fo: J, > S are two smooth parametrizations 
of the same k-dimensional surface § C R”, | having no critical points in J; and fo 
having no critical points in /2, then the mappings 


-1 -1 
fi of2:h—>h, fh oft: hah 
are smooth. 
2. The sphere in R". 


a) On the sphere S* = {x € R? | ||x|] = 1} exhibit a maximal domain of validity 
for the curvilinear coordinates (g, w) obtained from polar coordinates in R? (see 
formula (8.116) of the preceding section) when p = 1. 

b) Answer question a) in the case of the (m — 1)-dimensional sphere 


sm) = {x ER” | |x|] = 1} 


in R” and the coordinates (¢1, ..., @m— 1) on it obtained from polar coordinates in 
R” (see formula (8.118) of the preceding section) at p = 1. 

c) Can the sphere S* c R*+! be defined by a single coordinate system 
(t!,...,¢%), that is, a single diffeomorphism f : G > R**+! of a domain G C R*? 

d) What is the smallest number of maps needed in an atlas of the Earth’s surface? 

e) Let us measure the distance between points of the sphere S* C R? by the 
length of the shortest curve lying on the sphere S and joining these points. Such 
a curve is the arc of a suitable great circle. Can there exist a local flat map of the 
sphere such that all the distances between points of the sphere are proportional (with 
the same coefficient of proportionality) to the distances between their images on the 
map? 
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Fig. 8.13 


f) The angle between curves (whether lying on the sphere or not) at their point 
of intersection is defined as the angle between the tangents to these curves at this 
point. 

Show that there exist local flat maps of a sphere at which the angles between the 
curves on the sphere and the corresponding curves on the map are the same (see 
Fig. 8.13, which depicts the so-called stereographic projection). 


3. The tangent space. 


a) Verify by direct computation that the tangent manifold TS,, to a smooth k- 
dimensional surface S C R” at a point x9 € S is independent of the choice of the 
coordinate system in R”. 

b) Show that if a smooth surface S C D maps to a smooth surface S’ C D’ under 
a diffeomorphism f : D — D’ of the domain D C R” onto the domain D’ Cc R” and 
the point x9 € S maps to x4 € S’, then under the linear mapping f’(xo) : R" > R” 
tangent to f at x9 € D the vector space TS, maps isomorphically to the vector 
space TS 

c) If under the conditions of the preceding problem the mapping f : D — D’ is 
any mapping of class C“) (D; D’) under which f(S) C 8’, then f’(TSx)) C TS is 

d) Show that the orthogonal projection of a smooth k-dimensional surface s Cc 
R” on to the k-dimensional tangent plane TS,, to it at xo € S is one-to-one in some 
neighborhood of the point of tangency x9. 

e) Suppose, under the conditions of the preceding problem, that € € TS,, and 
él] = 1. 

The equation x — xo = &t of a line in R” lying in TS, can be used to characterize 
each point x € TS,,\xo by the pair (t, €). These are essentially polar coordinates in 
TSxp- 

Show that smooth curves on the surface S' intersecting only at the point xo cor- 
respond to the lines x — x9 = &t in a neighborhood of xo. Verify that, retaining t¢ 
as the parameter on these curves, we obtain paths along which the velocity at t = 0 
is the vector € € TS,, that determines the line x — x9 = &t, from which the given 
curve on S is obtained. 

Thus the pairs (t, €) where € € TS, ||& || = 1, and t are real numbers from some 
neighborhood U (0) of zero in R, can serve as the analogue of polar coordinates in 
a neighborhood of xo € S. 


4, Let the function F ¢ C“ (R”; R) having no critical points be such that the equa- 
tion F(x!,..., x”) =0 defines a compact surface S in R” (that is, § is compact as 
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a subset of IR”). For any point x € S we find a vector n(x) = grad F(x) normal to S$ 
at x. If we force each point x € S to move uniformly with velocity 9(x), a mapping 
Saxthx+n(x)t € R” arises. 


a) Show that for values of f sufficiently close to zero, this mapping is bijective 
and for each such value of t a smooth surface S; results from S. 

b) Let E be a set in R”; we define the 5-neighborhood of the set E to be the set 
of points in R” whose distance from F is less than 6. 

Show that for values of t close to zero, the equation 


RG ich =e 
defines a compact surface S$, C IR”, and show that the surface S; lies in the d(f)- 


neighborhood of the surface S;, where 6(t) = O(t) ast > 0. 
c) With each point x of the surface S = Sg we associate a unit normal vector 


x) 
ne GM 


and consider the new mapping SS xh x+n(x)t € R”. 

Show that for all values of ¢ sufficiently close to zero this mapping is bijective, 
that the surface S obtained from S at the specific value of ¢ is smooth, and if t) # fa, 
then Sy = AS, =. 

d) Relying on the result of the preceding exercise, show that there exists 
5 > 0 such that there is a one-to-one correspondence between the points of the 6- 
neighborhood of the surface S and the pairs (t,x), where t € ]—6, 6[C R, x € S. If 
(t!,..., t*) are local coordinates on the surface S$ in the neighborhood U, (xo) of xo, 
then the quantities (¢,t!,...,*) can serve as local coordinates in a neighborhood 
U(x) of xXQo€E R”. 

e) Show that for |t| < 6 the point x € S is the point of the surface S closest to 
(x +n(x)t) € R”. Thus for |t| < 6 the surface S; is the geometric locus of the points 
of R” at distance |t| from S. 


5. a) Let d, : S > R be the function on the smooth k-dimensional surface $ C R” 
defined by the equality d, (x) = || p — x||, where p is a fixed point of R”, x is a point 
of S, and || p — x|| is the distance between these points in R”. 

Show that at the extrema of the function d,(x) the vector p — x is orthogonal to 
the surface S. 

b) Show that on any line that intersects the surface S orthogonally at the point 
q € S, there are at most k points p such that the function d, (x) has q as a degenerate 
critical point (that is, a point at which the Hessian of the function vanishes). 

c) Show that in the case of a curve S (k = 1) in the plane IR? (n = 2) the point 
p for which the point g € S is a degenerate critical point of d,(x) is the center of 
curvature of the curve S at the point g € S. 
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6. In the plane R? with Cartesian coordinates x, y construct the level curves of the 
function f(x, y) = xy and the curve 


S={(, y) ER? |x? +y? =1}. 


Using the resulting picture, carry out a complete investigation of the problem of 
extrema of the function f|5. 
7. The following functions of class C‘°°) (IR; R) are defined on the plane R? with 
Cartesian coordinates x, y: 
2 —1/x? ome . 
f@y=x-y; Fx, y= = eee ere ne 
x“—y, ifx =0. 


a) Draw the level curves of the function f(x, y) and the curve S defined by the 
relation F(x, y) =0. 

b) Investigate the function f|s5 for extrema. 

c) Show that the condition that the form 9;; f(x0)é& ig J be positive-definite or 
negative-definite on TS,,, in contrast to the condition for the form 0;;L(x0)&'&/ on 
TSx, given in Theorem 2, is still not sufficient for the possible extremum xo € S to 
be an actual extremum of the function /|s. 

d) Check to see whether the point x9 = (0, 0) is critical for the function f and 
whether one can study the behavior of f in a neighborhood of this point using only 
the second (quadratic) term of Taylor’s formula, as was assumed in c). 

e) Consider the pair of functions F(x, y) = 2x?+ y, f(x, y) =x? +y and show 
that f may have a strict maximum at a point of the curve F(x, y) = 0, although on 
the tangent to the curve at this point the function f has a strict minimum. This again 
emphasizes the role of the function of Lagrange in the statement proved above for 
the sufficient condition for a relative extremum. 

f) Consider the pair of functions F(x, y) =x? — y?, f(x, y) = y and show that 
for L(x, y) = f(x, y) + AF the equation dL = 0 may not have solution, when the 
extremum of f is attained at a singular point of the curve F(x, y) = 0. This can be 
taken into consideration studying instead of L, the function L(x, y) = Ao f + AF, 
allowing the possibility of Ap = 0. 


8. In determining principal curvatures and principal directions in differential geom- 
etry it is useful to know how to find an extremum of a quadratic form h ijul u/ under 
the hypothesis that another (positive-definite) form g;;u'u/ is constant. Solve this 
problem by analogy with Example 9 which was discussed above. 

9. Let A= [a‘] be a square matrix of order 1 such that 


n 


Yi@y=aey G=1,....0), 


i=1 


where H|,..., H;, is a fixed set of m nonnegative real numbers. 
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a) Show that det” A can have an extremum under these conditions only if the 
rows of the matrix A are pairwise orthogonal vectors in R”. 
b) Starting from the equality 


det? A = det A - det A*, 
where A% is the transpose of A, show that under the conditions above 


max det” A = H---HMy. 


c) Prove Hadamard’s inequality for any matrix [a‘]: 


de*(a!) <]] (d310)"} 


j=1 \i=l 
d) Give an intuitive-geometric interpretation of Hadamard’s inequality. 


10. a) Draw the level surfaces of the function f and the plane S in Example 10. 
Explain the result obtained in this example on the figure. 

b) Draw the level curves of the function f and the line S in Example 11. Ex- 
plain the result obtained in this example on the figure. 


11. In Example 6 of Sect. 5.4, starting from Fermat’s principle we obtained Snell’s 
law for refraction of light at the interface of two media when the interface is a plane. 
Does this law remain valid for an arbitrary smooth interface? 

12. a) A point mass in a potential force field can be in an equilibrium position 
(also called a state of rest or a stationary state) only at critical (stationary) points 
of the potential. In this situation a position of stable equilibrium corresponds to a 
strict local minimum of the potential and an unstable equilibrium point to a local 
maximum. Verify this. 

b) To which constrained extremal problem (solved by Lagrange) does the prob- 
lem of the equilibrium position reduce for a point mass in a potential force field (for 
example, gravitational) with ideal constraints (for example, a point may be confined 
to a smooth surface or a bead may be confined to a smooth thread or a ball to a 
track)? The constraint is ideal (there is no friction); this means that its effect on the 
point (the reactive force of the constraint) is always normal to the constraint. 

c) What physical (mechanical) meaning do the expansion (8.166), the neces- 
sary criterion for a constrained extremum, and the Lagrange multiplier have in this 
case. 

Note that each of the functions of the system (8.160) can be divided by the abso- 
lute value of its gradient, which obviously leads to an equivalent system (if its rank 
is equal to m everywhere). Hence all the vectors grad F' (xq) on the right-hand side 
of (8.166) can be regarded as unit normal vectors to the corresponding surface. 

d) Do you agree that the Lagrange method of finding a constrained extremum 
becomes obvious and natural after the physical interpretation just given? 


Some Problems from the Midterm Examinations 


1 Introduction to Analysis 
(Numbers, Functions, Limits) 


Problem 1 The length of a hoop girdling the Earth at the equator is increased by 1 
meter, leaving a gap between the Earth and the hoop. Could an ant crawl through 
this gap? How big would the absolute and relative increases in the radius of the 
Earth be if the equator were lengthened by this amount? (The radius of the Earth is 
approximately 6400 km.) 


Problem 2 How are the completeness (continuity) of the real numbers, the un- 
boundedness of the series of natural numbers, and Archimedes’ principle related? 
Why is it possible to approximate every real number arbitrarily closely by ratio- 
nal numbers? Explain using the model of rational fractions (rational functions) that 
Archimedes’ principle may fail, and that in such number systems the sequence of 
natural numbers is bounded and there exist infinitely small numbers. 


Problem 3 Four bugs sitting at the corners of the unit square begin to chase one an- 
other with unit speed, each maintaining a course in the direction of the one pursued. 
Describe the trajectories of their motions. What is the length of each trajectory? 
What is the law of motion (in Cartesian or polar coordinates)? 


Problem 4 Draw a flow chart for computing ./a (a > 0) by the recursive procedure 


1 a 
Xntl = 2 Xn + %y : 


How is equation solving related to finding fixed points? How do you find </a? 


Problem 5 Let g(x) = f(x) + o(f(*)) as x > on. Is it also true that f(x) = 
g(x) + 0(g(x)) as x > co? 
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Problem 6 By the method of undetermined coefficients (or otherwise) find the first 
few (or all) coefficients of the power series for (1+x)® with a = —1, — 5 0, 5 1, 3. 
(By interpolating the coefficients of like powers of x in such expansions, Newton 
wrote out the law for forming the coefficients with any a € R. This result is known 


as Newton’s binomial theorem.) 


Problem 7 Knowing the power-series expansion of the function e*, find by the 
method of undetermined coefficients (or otherwise) the first few (or all) terms of 
the power-series expansion of the function In(1 + x). 


Problem 8 Compute exp A when A is one of the matrices 


0 0 01 0 1 0 1 0 0 
0 o}° 0 0}: 0 0 1, 0 2 0 
0 0 0 0 0 3 


Problem 9 How many terms of the series for e* must one take in order to obtain a 
polynomial that makes it possible to compute e* on the interval [-3, 5] within 10-7? 


Problem 10 If we know the power expansion of the functions sinx and cos x, find 
with the method of the undetermined coefficients (or another one) some few first 
terms (or all) the power expansion of the function tanx in a neighborhood of the 
point x = 0. 


Problem 11 The length of a band tightening the Earth at the Equator have increased 


by | meter, and after that the band was pulled propping up a vertical column. What 
is, roughly speaking, the height of the column if the radius of the Earth ~6400 km? 


1 I a 
lim (<(1 + =) ) ; 
X00 x 


Problem 13 Sketch the graphs of the following functions: 


Problem 12 Calculate 


3 


x 
logeos x SINX} b tan ———_—__.. 
a) logeog , SIN-X ) arctan d= Hada? 


2 One-Variable Differential Calculus 


Problem 1 Show that if the acceleration vector a(t) is orthogonal to the vector v(t) 
at each instant of time ¢, the magnitude |v(t)| remains constant. 


Problem 2 Let (x,t) and (X, 7) be respectively the coordinate of a moving point 
and the time in two systems of measurement. Assuming the formulas x = ax + Bt 
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and ¢ = yx + 6ét for transition from one system to the other are known, find the 


formula for the transformation of velocities, that is, the connection between v = ge 


5 de 
and v = |. 


Problem 3 The function f(x) = x? sin + for x £0, f(0) = 0 is differentiable on 
R, but f’ is discontinuous at x = 0 (verify this). We shall “prove”, however, that if 
f :R- R is differentiable on R, then f’ is continuous at every point a € R. By 
Lagrange’s theorem 


f(x) — f@ 
———— = f'), 
x-—a 
where & is a point between a and x. Then if x — a, it follows that € — a. By 
definition, 
iia EA) ar 
xa x-a 
and since this limit exists, the right-hand side of Lagrange’s formula has a limit 
equal to it. That is, f’(§) — f’(a) as € > a. The continuity of f’ at a is now 


“proved”. Where is the error? 


Problem 4 Suppose the function f has n + 1 derivatives at the point xo, and let = 
xo + 6, (x — xo) be the intermediate point in Lagrange’s formula for the remainder 
term 5, f(E)(x — x0)”, so that 0 < 6, < 1. Show that 6, > 5 as x > xo if 


n+l 
FY (x0) £0. 


Problem 5 


a) If the function f ¢ C({a, b], R) vanishes at n + 1 points of the interval 
[a, b], then there exists in this interval at least one zero of the function f ) the 
derivative of f of order n. 

b) Show that the polynomial P, (x) = 
[—1, 1]. (Hint: x2 — 1 = (@— D(x +B) and PR (-1) = P! (1) = 0, fork =0,..., 
n—1.) 


N(y2_4)\n R 
mo has n roots on the interval 


Problem 6 Recall the geometric meaning of the derivative and show that if the 
function f is defined and differentiable on an interval 7 and [a,b] C J, then the 
function f’ (not even necessarily continuous!) takes all the values between f’(a) 
and f’(b) on the interval [a, b]. 
Problem 7 Prove the inequality 


ay! ++-ah" <ayay +++ + Onan, 


where @1,...,dn, @1,...,@y are nonnegative anda, +---+a,=1. 


548 Some Problems from the Midterm Examinations 


Problem 8 Show that 


noo 


n 
lim (1+2) =e*(cosy+isiny) (¢=x-+iy), 
n 


so that it is natural to suppose that e!” = cos y +i sin y (Euler’s formula) and 
e =e*e!) =e* (cosy +isiny). 


Problem 9 Find the shape of the surface of a liquid rotating at uniform angular 
velocity in a glass. 


Problem 10 Show that the tangent to the ellipse xs + ys = | at the point (xo, yo) 


has the equation ee + com = |, and that light rays from a source situated at one 
of the foci F, = (-Va? — b?, 0), Fo = (Va? — b2, 0) of an ellipse with semiaxes 


a > b> Oare reflected by an elliptical mirror to the other focus. 


Problem 11 A particle subject to gravity, without any initial boost, begins to slide 
from the top of an iceberg of elliptic cross-section. The equation of the cross section 
is x* + Sy* = 1, y > 0. Compute the trajectory of the motion of the particle until it 
reaches the ground. 


Problem 12 The value 


ade)" 
n 


Sq(X1,X2,...,Xn) = ( 


is called the mean of order a of the numbers x1, x2,...,X,. In particular, for a = 
1,2, —1, we obtain the arithmetic mean, the mean square, and the harmonic mean, 
respectively, of these numbers. We will assume that the numbers x1, x2,..., xX, are 
nonnegative and if the exponent a is less than 0, then we will suppose even that they 
are positive. 


a) Show, using Holder’s inequality, that if a < B, then 


Sq(X1, X2, sie, »Xn) < Sp(X1, x2, ears Xn) 
and equality holds only when x) = x2 =---= Xp. 
b) Show that if aw tends to zero, then the value sy(x1,x2,...,X,) tends to 


AUX XQ*+*Xpn, 1.€., to the harmonic mean of these numbers. In view of the result 
of problem a), from here, for example, follows the classical inequality between the 
geometric and the arithmetic means of nonnegative numbers (write it down). 

c) Ifa > on, then sy (x1, Xx2,...,X,) > max{x1,X2,...,X,}, and fora > —oo, 
the value sy(x1,x2,...,X,) tends to the lowest of the considered numbers, i.e., to 
min{x,,x2,..., Xn}. Prove this. 
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Problem 13 Let r = r(t) denote the law of motion of a point (i.e., its radius vectors 
as a function of time). We suppose that it is a continuously differentiable function 
on the interval a <t <b. 


a) Is it possible, according to Lagrange’s mean value theorem, to claim that there 
exists a moment € on the interval [a, b] such that r(b) — r(a) = r'(&) - (b— a)? 
Explain your answer with examples. 

b) Let Convex{r’} be the convex hull (of all ends) of the vectors r’(t), t € [a, b]. 
Show that there exists a vector v € Convex{r’} such that r(b) — r(a) =v- (b—a). 

c) The relation |r(b) — r(a)| < sup|r’(t)| - |b — al], where the upper bound is 
taken over tf € [a, b] has an obvious physical sense. What is it? Prove this inequality 
as a general mathematical fact, developing the classical Lagrange theorem on finite 
increments. 


3 Integration and Introduction to Several Variables 


Problem 1 Knowing the inequalities of Hélder, Minkowski, and Jensen for sums, 
obtaining the corresponding inequalities for integrals. 


Problem 2 Compute the integral (i e-*” dx with a relative error of less than 10 %. 


Problem 3 The function erf(x) = ea js: et dt, called the probability error inte- 


gral, has limit 1 as x + +00. Draw the graph of this function and find its derivative. 
Show that as x + +00 


f(x) =1 Die Ss 0 S5ef 1 a de BO8 1 
erf(x) = e O : 
J 2x 22x3 -23x5 24x7 x! 
How can this asymptotic formula be extended to a series? Are there any values of 
x € R for which this series converges? 


Problem 4 Does the length of a path depend on the law of motion (the parametriza- 
tion)? 


Problem 5 You are holding one end of a rubber band of length 1 km. A beetle is 
crawling toward you from the other end, which is clamped, at a rate of 1 cm/s. 
Each time it crawls 1 cm you lengthen the band by | km. Will the beetle ever reach 
your hand? If so, approximately how much time will it require? (A problem of 
L.B. Okun’, proposed to A.D. Sakharov.) 


Problem 6 Calculate the work done in moving a mass in the gravitational field of 
the Earth and show that this work depends only on the elevation of the initial and 
terminal positions. Find the work done in escaping from the Earth’s gravitational 
field and the corresponding escape velocity. 
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Problem 7 Using the example of a pendulum and a double pendulum explain how 
it is possible to introduce local coordinate systems and neighborhoods into the set of 
corresponding configurations and how a natural topology thereby arises making it 
into the configuration space of a mechanical system. Is this space metrizable under 
these conditions? 


Problem 8 Is the unit sphere in R”, R5° or Cla, b] compact? 


Problem 9 A subset of a given set is called an e-grid if any point of the set lies at a 
distance less than ¢ from some point of the set. Denote by N (¢) the smallest possible 
number of points in an e-grid for a given set. Estimate the e-entropy log, N(e) of a 
closed line segment, a square, a cube, and a bounded region in R”. Does the quantity 
ae as as € — 0 give a picture of the dimension of the space under consideration? 


Can such a dimension be equal, for example, to 0.5? 


Problem 10 On the surface of the unit sphere S in R* the temperature T varies 
continuously as a function of a point. Must there be points on the sphere where 
the temperature reaches a minimum or a maximum? If there are points where the 
temperature assumes two given values, must there be points where it assumes inter- 
mediate values? How much of this is valid when the unit sphere is taken in the space 
C[a, b] and the temperature at the point f € S is given as 


b —1 
rin=(f flea) ? 


a) Taking 1.5 as an initial approximation to /2, carry out two iterations using 
Newton’s method and observe how many decimal places of accuracy you obtain at 
each step. 

b) By arecursive procedure find a function f satisfying the equation 


Problem 11 


fa)axt / flat. 
0 


4 Differential Calculus of Several Variables 


Problem 1 Local linearization. Consider and prove that local linearization is appli- 
cable to the following examples: instantaneous velocity and displacement; simpli- 
fication of the equation of motion when the oscillations of a pendulum are small; 
computation of linear corrections to the values of exp(A), A7!, det(E), (a, b) un- 
der small changes in the arguments (here A is an invertible matrix, E is the identity 
matrix, a and b are vectors, and (-, -) is the inner product). 
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Problem 2 


a) What is the relative error 6 = nis in computing the value of a function 
f(x, y, Z) at a point (x, y, z) whose coordinates have absolute errors Ax, Ay, and 
Az respectively? 

b) What is the relative error in computing the volume of a room whose di- 
mensions are as follows: length x = 5 + 0.05 m, width y = 4 + 0.04 m, height 
z=3+0.03 m? 

c) Is it true that the relative error of the value of a linear function coincides with 
the relative error of the value of its argument? 

d) Is it true that the differential of a linear function coincides with the function 
itself? 

e) Is it true that the relation f’ = f holds for a linear function f? 


Problem 3 


a) One of the partial derivatives of a function of two variables defined in a disk 
equals zero at every point. Does that mean that the function is independent of the 
corresponding variable in that disk? 

b) Does the answer change if the disk is replaced by an arbitrary convex region? 

c) Does the answer change if the disk is replaced by an arbitrary region? 

d) Let x = x(f) be the law of motion of a point in the plane (or in R”) in the time 
interval t € [a, b]. Let v(t) be its velocity as a function of time and C = conv{v(f) | 
t € [a, b]} the smallest convex set containing all the vectors v(t) (usually called 
the convex hull of a set that spans it). Show that there is a vector v in C such that 
x(b) — x(a) =v- (b—a). 


Problem 4 


a) Let F(x, y, z) =0. Is it true that e : ay : a = —1? Verify this for the relation 


— — 1=0 (corresponding to the Clapeyron equation of state of an ideal gas: = 


b) Now let F(x, y) = 0. Is it true that 2% 2 = 12 

c) What can you say in general about the relation F(x1,...,X,) =0? 

d) How can you find the first few terms of the Taylor expansion of the implicit 
function y = f (x) defined by an equation F(x, y) = 0 in a neighborhood of a point 
(xo, yo), knowing the first few terms of the Taylor expansion of the function F(x, y) 


in a neighborhood of (xo, yo), where F (xo, yo) = 0 and F. , (xo, yo) is invertible? 


Problem 5 


a) Verify that the plane tangent to the ellipsoid = sS rth 7 +2 a = | at the point 
(xo, Yo, Zo) can be defined by the equation “+ 230" + $= ae 


b) The point P(t) = (Bz = Py t hed from the ellipsoid = oa ett ie x + 


5s = | at time t = 1. Let p(t) i the point of the same ellipsoid anaes to P(t) at 


time ¢. Find the limiting position of p(t) as t > +00. 
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Problem 6 


a) In the plane IR? with Cartesian coordinates (x, y) construct the level curves 
of the function f(x, y) = xy and the curve S = {(x, y) € R* | x* + y* = 1}. Using 
the resulting picture, carry out a complete study of the extremal problem for f|s, 
the restriction of f to the circle S. 

b) What is the physical meaning of the Lagrange multiplier in Lagrange’s 
method of finding extrema with constraints when an equilibrium position is sought 
for a point mass in a gravitational field if the motion of the point is constrained by 
ideal relations (for example, relations of the form F(x, y, z) =0, F2(x, y, z) =0)? 


Problem 7 If in a vector space V one has a nondegenerate bilinear form B(x, y), 
then to every linear function g* € V* in this space there corresponds a unique vector 
g such that g*(v) = B(g, v), for every vector ve V. 


a) Show that if V = R”, Bx, y) = bijxix, and g*v = giv', then the vector g 
has coordinates g/ = b’/ g;, where b‘/ is the inverse of the matrix (b;;). 

A symmetric scalar product (-,-) in the Euclidean geometry or a skew-scalar 
product w(-,-) (when the form B is skew-symmetric) in the symplectic geometry 
appears most often as a bilinear form B(., -). 


I aye! 
b) Let B(vj, v2) = | a i: | be the oriented area of the parallelogram spanned by 
2.592. 


the vectors v,, v2 € R?. Find the vector g = (g!, g!) corresponding to the linear 
function g* = (gj, g2) and with respect to the form B if we know the coefficients 
of g*. 

c) The vector corresponding to the differential of a function f : R” — R at the 
point x relative to the scalar product (-,-) of the Euclidean space R” is called as 
usual the gradient of the function f at this point and denoted by grad f. Thus, 
df (x)u := (grad f, v) for every vector v € TR” ~ R”, applied at x. 

Therefore, 


of 
ax! 


0 
s (x)v" = (grad f(x), v) = | grad f(x)| -|v| cos@. 


f' (x)u = —— (x) $e + aga 


c,) Show that in the standard orthonormal basis, i.e., in Cartesian coordinates, 
grad f (x) = (46, woes PEG). 

c2) Show that the rate of growth of the function f under a motion from the 
point x with unit velocity is maximal when the direction of the movement coin- 
cides with the direction of the gradient of the function f at this point and is equal 
to | grad f(x)|. When the movement has a perpendicular direction to the vector 
grad f (x), the function does not change. 

c3) How do the coordinates of the vector grad f(x) in R* change if instead 
of considering the canonical basis (e1, e2), we take an orthogonal basis (é), é2) = 
(Aje1, Ae2)? 

1 of 


c4) How do we calculate grad f in polar coordinates? Answer: (2, ; ag)" 
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d) In exercise b) above, we considered a skew-symmetric form B(v1, v2) of the 
oriented area of the parallelogram in R?. 

Since the vector corresponding to df (x) relative to the symmetric form (-, -) 
is called the gradient grad f(x), the vector corresponding to df (x) relative to the 
skew-symmetric form B is called skew-gradient and denoted by sgrad f (x). Write 
down grad f(x) and sgrad f(x) in Cartesian coordinates. 


Problem 8 


a) Show that in R? (and in general in R7”+'), there are no nondegenerate skew- 
symmetric bilinear forms. 

b) In the oriented space R? there is a nondegenerate skew-symmetric bilinear 
form (the oriented area of the parallelogram), as we have seen. In R2” with coordi- 
nates (x!,...,x",.0.,22") = (een an ae ...,q"), such a form @ also exists: 


if Uj = (py iva DE Ges 02590") E= 1,2), then 


yh 
Py 


1 
Pi 4 
ov, 02) =] 4 feet 
P2 


1 
1 
1 
2 


That means that w(v1, v2) is the sum of the oriented areas of parallelograms 
spanned by the projections of the vectors v1, v2 in the coordinate plane (p/, q/) 
(j=1,...,n). 


b;) Let g* be a linear function in R?”, given with its coefficients g* = 
(P1,-++s Pas U1>+++> Gn). Find the coordinates for the vector g mapped by the func- 
tion g* through the form w. 

b2) The differential of the function f : R7” > R at the point x € R7” through 
the skew-symmetric form @ is associated with a vector called the skew-gradient of 
the function f, as we already said, at this point and denoted by sgrad f (x). Find the 
expression for sgrad f(x) in the canonical Cartesian coordinates in the space R2”. 

b3) Find the scalar product (grad f(x), sgrad f(x)). 

b4) Show that the vector sgrad f(x) is directed along the level surface of the 
function f. 

bs) The law of motion x = x(t) of a point x in the space R*” is such that x(t) = 
sgrad f (x(t)). Show that f(x(t)) = const. 

be) Write down the equation x = sgrad f (x) in the canonical Cartesian notation 
(p!,..., p",q',...,q") for the coordinates and H = H(p,q) for the function f. 
The resulting system, called a system of Hamilton’s equations, is one of the central 
objects of mechanics. 


Problem 9 Canonical variables and Hamilton’s system of equations. 


a) In the calculus of variations and in the fundamental variational principles of 
classical mechanics, the following system of Euler-Lagrange equations plays an 
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important role: 


oL ddL 
(= al Tae) eH=0, 


v=x(t), 


where L(t, x, v) is a given function in the variables ft, x, v, where t usually denotes 
time, x the coordinate, and v the velocity. This is a system of two equations in three 
variables. From it, one usually wants to find the dependence relations x = x(t) 
and v = v(t), which essentially boils down to finding the law of motion x = x(f), 
since v = x(t). Write down the first equation of the system in detail, revealing the 
derivative ¢ given that x = x(t) and v= v(f). 

b) Show that in transition from the variables t, x, v, L to the canonical variables 
t,x, p, H by taking the Legendre transform of 


_ OL 
ES Fe 
H=pv—-L, 


with respect to the variables v, L, interchanging them with the variables p, H, the 
Euler-Lagrange system acquires the symmetric form: 


c) In mechanics, we often use the notation g and q, instead of x and v. In many 
cases when L(t,g,g) = L(t,q',....q,q',...,q”), the Euler-Lagrange system 
of equations has the form 


oL doOL CGE ) 

eS 5q; = ~T=1,...,M). 

bg deagiy. 
Take the Legendre transform with respect to the variables g and L, and go from 
the variables t, g, g, L to the canonical variables t, g, p, H and show that this 
Euler-Lagrange system transforms into the following Hamilton equations: 


0H : 0H 
ae Hae (i=1,...,m). 
ap 


Examination Topics 


1 First Semester 


1.1 Introduction to Analysis and One-Variable Differential 
Calculus 


1. Real numbers. Bounded (from above or below) numerical sets. The axiom of 
completeness and the existence of a least upper (greatest lower) bound of a set. 
Unboundedness of the set of natural numbers. 

2. Fundamental lemmas connected with the completeness of the set of real numbers 
R (nested interval lemma, finite covering, limit point). 

3. Limit of a sequence and the Cauchy criterion for its existence. Tests for the exis- 
tence of a limit of a monotonic sequence. 

4. Infinite series and the sum of an infinite series. Geometric progressions. The 
Cauchy criterion and a necessary condition for the convergence of a series. The 
harmonic series. Absolute convergence. 

5. A test for convergence of a series of nonnegative terms. The comparison theorem. 
The series €(s) = 7 -n-*. 

6. The limit of a function. The most important filter bases. Definition of the limit 
of a function over an arbitrary base and its decoding in specific cases. Infinitesimal 
functions and their properties. Comparison of the ultimate behavior of functions, 
asymptotic formulas, and the basic operations with the symbols o(-) and O(-). 

7. The connection of passage to the limit with the algebraic operations and the order 
relation in R. The limit of “"* as x > 0. 

8. The limit of a composite function and a monotonic function. The limit of (1 + 
1)* as x > ©O. 

9. The Cauchy criterion for the existence of the limit of a function. 

10. Continuity of a function at a point. Local properties of continuous functions (lo- 
cal boundedness, conservation of sign, arithmetic operations, continuity of a com- 
posite function). Continuity of polynomials, rational functions, and trigonometric 
functions. 
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11. Global properties of continuous functions (intermediate-value theorem, max- 
ima, uniform continuity). 

12. Discontinuities of monotonic functions. The inverse function theorem. Conti- 
nuity of the inverse trigonometric functions. 

13. The law of motion, displacement over a small interval of time, the instanta- 
neous velocity vector, trajectories and their tangents. Definition of differentiability 
of a function at a point. The differential, its domain of definition and range of val- 
ues. Uniqueness of the differential. The derivative of a real-valued function of a 
real variable and its geometric meaning. Differentiability of sinx, cosx, e*, In |x|, 
and x®. 

14. Differentiability and the arithmetic operations. Differentiation of polynomials, 
rational functions, the tangent, and the cotangent. 

15. The differential of a composite function and an inverse function. Derivatives of 
the inverse trigonometric functions. 

16. Local extrema of a function. A necessary condition for an interior extremum of 
a differentiable function (Fermat’s lemma). 

17. Rolle’s theorem. The finite-increment theorems of Lagrange and Cauchy 
(mean-value theorems). 

18. Taylor’s formula with the Cauchy and Lagrange forms of the remainder. 

19. Taylor series. The Taylor expansions of e*, cos x, sinx, In(1 +x), and (1 + x)* 
(Newton’s binomial formula). 

20. The local Taylor formula (Peano form of the remainder). 

21. The connection between the type of monotonicity of a differentiable function 
and the sign of its derivative. Sufficient conditions for the presence or absence of a 
local extremum in terms of the first, second, and higher-order derivatives. 

22. L’ Hopital rule. 

23. Convex functions. Differential conditions for convexity. Location of the graph 
of a convex function relative to its tangent. 

24. The general Jensen inequality for a convex function. Convexity (or concavity) of 
the logarithm. The classical inequalities of Cauchy, Young, Hélder, and Minkowski. 
25. Legendre transform. 

26. Complex numbers in algebraic and trigonometric notation. Convergence of a 
sequence of complex numbers and a series with complex terms. The Cauchy crite- 
rion. Absolute convergence and sufficient conditions for absolute convergence of a 
series with complex terms. The limit limy—,..(1 + £)”. 

27. The disk of convergence and the radius of convergence of a power series. The 
definition of the functions e*%, cos z, sinz (z € C). Euler’s formula and the connec- 
tions among the elementary functions. 

28. Differential equations as a mathematical model of reality, examples. The 
method of undetermined coefficients and Euler’s (polygonal) method. 

29. Primitives and the basic methods of finding them (termwise integration of sums, 
integration by parts, change of variable). Primitives of the elementary functions. 
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2 Second Semester 


2.1 Integration. Multivariable Differential Calculus 


1. The Riemann integral on a closed interval. Upper and lower sums, their geo- 
metric meaning, the behavior under a refinement of the partition, and mutual esti- 
mates. Darboux’s theorem, upper and lower Darboux’s integrals and the criterion 
for Riemann integrability of real-valued functions on an interval (in terms of sums 
of oscillations). Examples of classes of integrable functions. 

2. The Lebesgue criterion for Riemann integrability of a function (statement only). 
Sets of measure zero, their general properties, examples. The space of integrable 
functions and admissible operations on integrable functions. 

3. Linearity, additivity and general evaluation of an integral. 

4. Evaluating the integral of a real-valued function. The (first) mean-value theorem. 
5. Integrals with a variable upper limit of integration, their properties. Existence 
of a primitive for a continuous function. The generalized primitive and its general 
form. 

6. The Newton—Leibniz formula. Change of variable in an integral. 

7. Integration by parts in a definite integral. Taylor’s formula with integral remain- 
der. The second mean-value theorem. 

8. Additive (oriented) interval functions and integration. The general pattern in 
which integrals arise in applications, examples: length of a path (and its indepen- 
dence of parametrization), area of a curvilinear trapezoid (area under a curve), vol- 
ume of a solid of revolution, work, energy. 

9. The Riemann-Stieltjes integral. Conditions under which it can be reduced to 
the Riemann integral. Singularities and the Dirac delta-function. The concept of a 
generalized function. 

10. The concept of an improper integral. Canonical integrals. The Cauchy criterion 
and the comparison theorem for studying the convergence of an improper integral. 
The integral test for convergence of a series. 

11. Metric spaces, examples. Open and closed subsets. Neighborhoods of a point. 
The induced metric, subspaces. Topological spaces. Neighborhoods of a point, sep- 
aration properties (the Hausdorff axiom). The induced topology on subsets. Closure 
of a set and description of relatively closed subsets. 

12. Compact sets, their topological invariance. Closedness of a compact set and 
compactness of a closed subset of a compact set. Nested compact sets. Compact 
metric spaces, ¢-grids. Criteria for a metric space to be compact and its specific 
form in R”. 

13. Complete metric spaces. Completeness of R, C, R”, C”, and the space C[a, b] 
of continuous functions under uniform convergence. 

14. Criteria for continuity of a mapping between topological spaces. Preservation 
of compactness and connectedness under a continuous mapping. The classical the- 
orems on boundedness, the maximum-value theorem, and the intermediate-value 
theorem for continuous functions. Uniform continuity on a compact metric space. 
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15. The norm (length, absolute value, modulus) of a vector in a vector space; the 
most important examples. The space L(X, Y) of continuous linear transformations 
and the norm in it. Continuity of a linear transformation and finiteness of its norm. 

16. Differentiability of a function at a point. The differential, its domain of defi- 
nition and range of values. Coordinate expression of the differential of a mapping 
f :R” > R’. The relation between differentiability, continuity, and the existence 
of partial derivatives. 

17. Differentiation of a composite function and the inverse function. Coordinate 
expression of the resulting laws in application to different cases of the mapping 
f:R” > R". 

18. Derivative along a vector and the gradient. Geometric and physical examples 
of the use of the gradient (level surfaces of functions, steepest descent, the tangent 
plane, the potential of a field, Euler’s equation for the dynamics of an ideal fluid, 
Bernoulli’s law, the work of a wing). 

19. Homogeneous functions and the Euler relation. The dimension method. 

20. The finite-increment theorem. Its geometric and physical meaning. Examples 
of applications (a sufficient condition for differentiability in terms of the partial 
derivatives; conditions for a function to be constant in a domain). 

21. Higher-order derivatives and their symmetry. 

22. Taylor’s formula. 

23. Extrema of functions (necessary and sufficient conditions for an interior ex- 
tremum). 

24. Contraction mappings. The Picard—Banach fixed-point principle. 

25. The implicit function theorem. 

26. The inverse function theorem. Curvilinear coordinates and rectification. Smooth 
k-dimensional surfaces in R” and their tangent planes. Methods of defining a surface 
and the corresponding equations of the tangent space. 

27. The rank theorem and functional dependence. 

28. Local resolution of a diffeomorphism as the composition of elementary ones 
(diffeomorphisms changing only one coordinate). 

29. Extrema with constraint (necessary condition). Geometric, algebraic, and phys- 
ical interpretation of the method of Lagrange multipliers. 

30. A sufficient condition for a constrained extremum. 


Appendix A 
Mathematical Analysis 
(Introductory Lecture) 


A.1 Two Words About Mathematics 


Mathematics is an abstract science. For example, it teaches how to count and add no 
matter of whether we count ravens, capital, or something else. Therefore, mathemat- 
ics is one of the most universal and commonly used applied sciences. Mathematics 
as a science possesses a large number of features, and therefore it is usually treated 
with respect. For example, it teaches us to listen to arguments and appreciate the 
truth. 

Lomonosov believed that mathematics leads the mind into order, and Galilei said, 
“The great book of nature is written in the language of mathematics.” The evidence 
of this is obvious: anyone who wants to read this book, must have studied math- 
ematics. Among them, there are representatives of natural sciences, technical pro- 
fessions, as well as humanities. As examples, there is the chair of mathematics at 
the Economics Faculty of Moscow State University, and also there is an Institute of 
Economical Mathematics in the system of the Russian Academy of Sciences. There 
is even a statement saying that a branch of science contains as much of science as 
there is mathematics inside of it. Although this point of view is too strong, it is in 
general a quite sharp observation. 

Mathematics has the attributes of a language. However, it is clearly not merely a 
language (otherwise, it would have been studied by philologists). Mathematics not 
only can translate a question into mathematical language, but it usually provides the 
method for solving the formulated mathematical problem. 

The capability to pose a question correctly is the great art of researchers in gen- 
eral and mathematicians in particular. 

The great mathematician Henri Poincaré, whose influence is clearly seen in most 
courses in contemporary mathematics, remarked with humor, “Mathematics is the 
art of calling different things by the same name.” For instance, a point is a barely 
visible particle under a microscope, an airplane on air traffic controls radar, a city 
on a map, a planet in the sky, and in general all those things, whose dimensions can 
be neglected on these scales. 
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Therefore, the abstract concepts of mathematics and their relationships, like num- 
ber, are very useful in a tremendous sphere of specific phenomena and consistent 
patterns. 


A.2 Number, Function, Law 


The usual reaction to a miracle, “Is it really possible???” quickly and imperceptibly 
transforms into, “It could not be otherwise!!!” 

We are so used to the fact that 2+ 3 = 5, that we don’t see any miracle there. 
But here it is not said that two apples and three apples will add up to five apples, 
it is said that this is the case for apples, elephants, and all other things. We already 
mentioned this above. 

Next, we get used to the fact that a+ b= b+-a, where now the symbols a and b 
could mean 2, 3, or any other integer. 

A function, or functional dependence, is the next mathematical miracle. It is rel- 
atively young as a scientific concept: just a little over three hundred years old. Nev- 
ertheless, we are confronted with it in nature and even in everyday life no less than 
with elephants or apples. 

Every science or every field of human activity deals with a concrete area of ob- 
jects and their relations. These relations, laws, or dependences, are described and 
studied by mathematics in an abstract and general way, relating the terms function 
or functional dependence y = f (x) of the state (value) of one variable (y) to some 
other state (or value) of another variable (x). 

It is especially important that now we are not dealing with constants, but with 
variables x and y and with the rule f relating them. A function is adapted to describe 
developing processes and phenomena, the nature of change of their states, and in 
general to describe the dependent variable. 

Sometimes the rule f of a relation is known (given), e.g., by a government or 
by a technological process. We often try then, under the constraints of the acting 
tule f, to choose a strategy, i.e., some state (value) available to our choice of the 
independent variable x, in order to obtain the most favorable state (value) for us (in 
one way or another) of the variable y (given that y = f(x)). 

In other cases (and this is even more exciting), we search for the law of nature 
relating certain phenomena. Though it is the job of experts in the corresponding 
specific branch of science, mathematics might be extremely helpful. Like Sherlock 
Holmes, it can deduce the new law f from very limited information accessible to 
the experts in this narrow domain. Following the parallel with Sherlock Holmes, 
mathematics uses a “deduction method” called differential equations, which were 
unknown to ancient mathematicians, and that emerged with the advent of differential 
and integral calculus at the seventeenth and eighteenth centuries thanks to the efforts 
of Newton, Leibniz, and their predecessors and successors. 

So, let us begin a primer of modern mathematics. 
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A.3 Mathematical Model of a Phenomenon 
(Differential Equations, or We Learn How to Write) 


One of the brightest and most long-lived impressions of school mathematics, of 
course, is that small miracle when you want to find something unknown to you, 
denoting it by the letter x or by the letters x, y, and then you write something like 
a-x =b or some system of equations 


2x+y=1, 
x—y=2. 


After a bit of mathematical hocus-pocus, you discover what was unknown to you: 
x=l,y=-l. 

Let us try to learn how to write the equation in a new situation, when we do not 
have to find a number but instead the unknown rule relating two variables that are 
important for us, i.e., we look for the requisite function. Let us take a look at some 
examples. 

In order to be more specific, we first shall talk about biology (proliferation of 
microorganisms, growth of biomass, ecological limitations, etc.). However, it will 
be clear that all this can be transferred to other areas, as for example the growth of 
capital, nuclear reactions, atmospheric pressure, and so on. 

To warm up, consider the following playful problem: 

Consider a primitive organism that replicates itself every second (doubling). Sup- 
pose it is put into an empty glass. After one minute, the glass is filled up. How long 
will it take to fill up the glass if instead of one organism, two of these organisms are 
put into an empty glass? 

Now we get closer to our goal, and the promised examples. 


Example I It is known that under favorable conditions, the reproduction rate of 
microorganisms, i.e., the biomass growth rate, is proportional (with a coefficient 
of proportionality k) to the current amount of biomass. We need to find the rule 
x = x(t) of the change of biomass over time if the initial condition x(0) = xo is 
known. 

We expect that if we knew the rule itself x = x(t) of the variable x, we would 
know its rate of change at any time ¢. Without going too far into the discussion about 
how we calculate this rate of change from x(t), we shall denote it by x’(t). Since 
the function x’ = x'(t) is obtained from the function x(t), in mathematics it is called 
the derivative of the function x = x(t) (in order to find out how to calculate the 
derivative of a function and much more, it is necessary to learn differential calculus. 
This is coming). 

Now we are able to write what is given to us: 


x'(t) =k- x(t), (A.1) 


where x(0) = xo, and we want to find the dependence itself. 
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We just wrote down the first differential equation (A.1). In general, equations 
containing derivatives are called differential equations (some clarifications and ex- 
ceptions are not yet relevant). It is worth noting that the independent variable is 
often omitted, in order to simplify the text while writing the equation. For instance, 
Eq. (A.1) is written in the form x’ =k - x. If the desired function is denoted by the 
letter f or u, then the same equation would have the form f’=k- f oru!/=k-u 
respectively. 


It is now clear that if we learn not only how to write, but also to solve or study 
differential equations, we will be able to predict and know many things. That is why 
Newton’s sacramental phrase referring to the new calculus, was something like this: 
“Tt is useful to learn how to solve differential equations.” 


Problem 1 Write down an equation for the example above of reproduction in a 
glass. Which coefficient k, initial condition x(0) = xo, and dependence x = x(t) do 
we have? 


Having succeeded with the first equation, let us try to encode by a differential 
equation several further natural phenomena. 


Example 2 Assume now, as is always the case, that there is not infinitely much 
food and that the environment cannot support more than M individuals, i.e., that 
the biomass does not exceed the value M. Then the growth rate of the biomass 
will presumably decrease, for instance proportionally to the remaining conditions 
of the environment. As a measure of the remaining conditions, you can consider the 
difference M — x(t), or even better consider the dimensionless value | — U8 In 
this situation, instead of Eq. (A.1), we obviously have the equation 


eaten (1-22), (A.2) 


which reduces to Eq. (A.1) at the stage when x(t) is much less than M. Conversely, 
when x(t) is very close to M, the growth rate becomes close to zero, i.e., the growth 
stops, which is natural. After having improved some skills, we shall find out later 
precisely how the rule x = x(t) looks in this case. 


Problem 2 A body having an initial temperature 7p cools down in an environment 
that has a constant temperature C. Let T = T(t) be the rule of variation of temper- 
ature of the body over time. Write down the equation that this function must satisfy, 
assuming that the cooling rate is proportional to the temperature difference between 
the body and the environment. 


The velocity u(t), which is the rate of change of the variable x(t), is called the 
derivative of the function x(t) and denoted by x’(r). 

The acceleration a(t), as you may know, is the rate of change of the velocity v(t). 
This means that a(t) = v'(t) = (x’)'(t), i.e., it is the derivative of the derivative of 
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the initial function. It is called the second derivative of the initial function and it is 
frequently denoted by x” (t) (some other notation will appear later). If we are able to 
find the first derivative, we may repeat the same process to calculate the derivative 
x(t) of any order n of the initial function x = x(t). 


Example 3 Let x = x(t) be the rule of motion of a point with mass m, i.e., coordi- 
nates for the position of the point as function of time. For simplicity, we may assume 
that the motion is along a straight line (horizontal or vertical); thus there is only one 
coordinate. 

The classical Newton’s law m-a = F, relating the force acting on a point of mass 
m with acceleration caused by this action, can be written as 


m-x"(th= F(t), (A.3) 


or in abbreviated form, m- x” = F. 

If the acting force F(t) is known, then the relation m - x” = F can be considered 
a differential equation (of second order) with respect to the function x(f). 

For example, if F is the force of gravity on Earth’s surface, then F = mg, where 
g is the acceleration at free fall. In this case, our equation has the form x’(t) = g. 
As you might know, Galileo discovered that in free fall, x(t) = 5 gt? + vot + xo, 
where Xo is the initial position and vg is the initial speed of the point. 

In order to check that this function satisfies the equation, it is necessary to be able 
to differentiate a function, i.e., to find its derivative. In our case, we even need the 
second derivative. 

Just below we shall give a small table of some functions and their derivatives. Its 
deduction shall be made later, in the systematic presentation of differential calculus. 
Now try yourself to do the following. 


Problem 3 Write down the equation of free fall in the atmosphere. In this case, 
there will arise a resistance force. Consider it proportional to the first (or second) 
rate of change of the movement (the speed in free fall does not grow to infinity due 
to the presence of the force of resistance). 


You should be convinced by now that it is worthwhile learning how to calculate 
derivatives. 


A.4 Velocity, Derivative, Differentiation 


First, let us consider a familiar situation in which we can trust our intuition (and we 
change the notation from x(f) to s(t)). 

Suppose that a point moves on the whole numerical real line, s(t) is its coordinate 
at the moment ¢, and v(t) = s’(ft) is its velocity at the same moment t. After the 
time interval h, from its former position at the moment f, the point is located at the 
position s(t +h). In our picture of velocity the quantity s(t +h) — s(t), the distance 
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traveled in the time interval h after the moment f¢, and its velocity at the moment t 
are related by the equation 


s(t+h)—s(t) ¥v(t)-h; (A.4) 
in other words, v(t) ¥ a, and this approximation becomes closer to an 
equality as the interval h decreases its length. 

Thus, we have to assume that 


v(t) := lim cl dei a 
h>0 h 
i.e., we define v(t) as the limit of the quotient between the increment of the function 
and the increment of its argument as the latter approaches zero. 
Now after this example, nothing impedes us from providing the usual definition 
for the value f’(x) for the derivative f’ of the function f at the point x: 


| (A5) 


a See | 
FG) = en 
i.e., f’(x) is the limit of the quotient of increments Af/Ax as Ax approaches zero, 
where Af = f(x +h) — f(x) is the increment of the function with respect to the 
increment of its argument Ax = (x +h) — x. 
Equation (A.5) can be rewritten in a similar form to (A.4) in the same convenient 
and useful form 


Seth) — fO= f' Oh + o(h), (A.6) 


where o(h) is some error (of approximation), small in relation to , as h approaches 
zero. (This means that the quotient o()/h approaches zero as h goes to zero.) 
Now we shall make some concrete calculations. 


1. Let f be a constant function, i.e., f(x) =c. Then it is clear that Af = f(x + 
h) — f(x) =O and f’(x) =0. This is natural, since the velocity of change is equal 
to zero if there is no change. 

2. If f(x) =x, then f(x +h) — f(x) =h, and therefore f’(x) = 1. If f(x) =kx, 
then f(x +h) — f(x) =khand f’ =k. 

3. By the way, we can make two very common but extremely useful remarks: if 
the function f has its derivative f’, then the function cf, where c is some arbitrary 
constant, has as derivative cf’, i.e., (cf)’ = cf’; in the same way, (f +g)’ = f’ +2", 
i.e, the derivative of a sum is equal to the sum of the derivatives, if these derivatives 
are defined. 

4, Let f(x) =x?. Then f(x +h) — f (x) = (ath)? —x? = 2xh+h? = 2xh+o(h); 
hence f’(x) = 2x. 

5. Analogously, if f(x) = x3, then 


f(x +h) — fx) = (x +h) — x3 = 3x7h + 3xh? +h? = 3x7h + O(h); 


therefore, f’(x) = 3x. 
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Table A.1 

f(x) f') f"@) He f™() 

a* a*|na a“ Ina ee a“ \n"a 

e* e* e* e* 

sinx cosx —sinx tee sin(x + n7/2) 
cos x —sinx —cosx tee cos(x + nz /2) 
(1+ x)* a(l+x)*7! a(a — 1)(1+x)%~2 tee ? 

xe axe! a(a — 1)x%-2 tee ? 


6. Now it is clear that in general, if f(x) =x”, then one has 


Fath) — f@)=@thy" —x" =nx""h + o(h), 


and therefore f’(x) =nx"~!, 


7. This implies that if we have a polynomial 
P(x) =aox" +ayx" | + +++ +ay—1x +an, 
then 
P! (x) = nagx”—! + (a — layx"? +++ + ay_1. 


We calculated the above derivatives following the definitions. To develop and master 
the techniques of differentiation you have to practice. Now as examples and for your 
illustration we introduce Table A.1 with functions and their derivatives. Later we 
shall prove the results. 

In Table A.1 e is anumber (e = 2.7...), that appears everywhere in analysis, just 
like the number z in geometry. The logarithm with base e is frequently denoted by 
In, instead of log,, and you can find it in the first row of the table in the second and 
third columns. The logarithm with this base is called the natural logarithm and it 
appears in many formulas. 


Problem 4 Assuming that the formula of the first derivative f’ is correct, verify the 
expression for f) and complete the table at the places where the question marks 
are. After that, calculate the value f (0) in every case. 


Problem 5 Try to find the derivative of the function f(x) =e and the solution of 
Eq. (A.1). Explain at what point in time the initial condition xo (capital, biomass, or 
something else represented by this equation) will double. 


A.5 Higher Derivatives, What for? 


A wonderful and very useful expansion of the central equation (A.6), which can be 
written as 


fx +h) = fa) + f’@)h + o(h), (A.7) 
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turns into the following formula (Taylor’s formula): 


f@+h=fay+— ai Wh fon per fo +0(h"). (A.8) 
If we set x = 0 and then replace the letter h by the letter x, we obtain 
f(x)=fO+ Os # 5 fOr ene < f(O)x" +0(x"). (A) 
For instance, if f(x) = (1 + x)®, following Newton we find that 


Gee aia 8 eg i eo: 
1! 2! n! 
(A.10) 
Sometimes in the formula (A.9), it is possible to continue the sum to infinity, 
removing the reminder term, since it tends to zero as n — oo. 


In particular, we have, 


1 1 ti 
a ae +5 se a ash (A.11) 
_ 1 2 k 2k 
cose =1— Sa? +e +(-1) a ibsaey (A.12) 
Sipe ea ee yt _ yet (A.13) 
Wo 3! (2k + 1)! 


We have obtained a representation of relatively complex functions as a sum (an 
infinite sum, a series) of simple functions, which can be calculated with the usual 
arithmetic operations. The finite pieces of these sums are polynomials. They provide 
good approximations of the functions decomposed in such a series. 


A.5.1 Again Toward Numbers 


We have always tacitly assumed that we are dealing with functions defined on the 
set of real numbers. But the right-hand side of Eqs. (A.11), (A.12), and (A.13) make 
sense when x is replaced by a complex number z = x + iy. Then, we are able to say 
what the expressions e%, cos z, sin z mean. 


Problem 6 Discover, after Euler, the following impressive formula: e'? = cos g + 

ising, linking these elementary functions and the remarkably beautiful equality 
” + 1 =0 resulting from this formula, which combines the basic constants of the 

mathematical sciences (arithmetic, algebra, analysis, geometry, and even logic). 
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A.5.2 And What to Do Next? 


As we say in Russian, “with the fingers”, without details and justifications, you have 
been given some idea of differential calculus, which is the core of a first-semester 
course in mathematical analysis. Step by step, we have become acquainted with 
the concepts of numbers, functions, limits, derivatives, series, which had been only 
superficially studied so far. 

Now you know why it is necessary to take time to dive into a detailed and careful 
consideration of all of these concepts and objects. The understanding of them is nec- 
essary for a professional mathematician. For the average user, this is not required. 
Most people drive a car without even opening the hood. But that is reasonable, be- 
cause someone well versed in engines designed a machine that works reliably. 


Appendix B 
Numerical Methods for Solving Equations 
(An Introduction) 


B.1 Roots of Equations and Fixed Points of Mappings 


Notice that the equation f(x) = 0 is obviously equivalent to the equation a(x) f (x) 
= 0 if a(x) £0. The last equation, in turn, is equivalent to the relation x = x — 
a(x) f(x), where x can be interpreted as a fixed point of the mapping g(x) := x — 
a(x) f(x). 

Thus, finding roots of equations is equivalent to finding the fixed points of the 
corresponding mappings. 


B.2 Contraction Mappings and Iterative Process 


A mapping g : X — X from the set X C R to itself is called a contraction mapping 
if there exists a number q, 0 < g < 1, such that for every pair of points x’, x”, their 
images satisfy the inequality |y(x’) — g(x”)| < q|x’ — x” |. 

It is clear that this definition applies to arbitrary sets, without any changes, 
on which the distance d(x’, x”) between any two points is defined; in our case, 
d(x’, x”) = |x’ = x" |. 

It is also obvious that a contraction mapping is continuous and cannot have more 
than one fixed point. 

Let g : [a, b] > [a, b] be a contraction mapping from the interval [a, b] into it- 
self. We shall show that the iteration process x,+1 = ¢(Xn), starting at any point xo 
from this interval, leads to the point x = limy-+o0 xX», the fixed point of the map- 
ping @. 

We notice first that 


[Xn-+1 —Xnl < q\Xn —X-1l<--9°S q”"|x1 — xol. 
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Therefore, for all natural numbers m, n, with m > n, inserting intermediate points 
and using the triangle inequality, we get the estimate 


Xm — Xnl S |Xm — Xm—1| +++ + Xn — Xn 


q 


<(q™ ' +--+ +q")|x1 — x0! < = 


|x1 — xol, 


and from this it follows that the sequence {x,} is fundamental (i.e., is a Cauchy 
sequence). 

Hence, by the Cauchy criterion it converges to a point x in the interval [a, b]. 
This point is a fixed point of the mapping ¢: [a, b] > [a, b], since by taking the 
limit n — oo in the relation x,4+1 = g(%»), we obtain the equality x = g(x). 

(Here we used the obvious fact that a contraction mapping is continuous; actually, 
it is even uniformly continuous.) 

By passing to the limit m — oo in the relation |x, — x,| < is |x — xo|, one 
gets the estimate 


n 


q 
Ix — Xnl < ar xo| 


for the deviation value between the approaching point x, and the fixed point x of 
the mapping ¢. 


B.3 The Method of Tangents (Newton’s Method) 


We presented a theorem stating that a continuous real-valued function from an inter- 
val taking values with different signs at the extremes of the interval has at least one 
zero in this interval (a point x where f(x) = 0). In its proof, we showed the sim- 
plest, but universal, algorithm for finding this point (splitting the interval in half). 
The order of the speed of convergence in this case is 2~”. 

In the case of a convex differentiable function, the method proposed by Newton 
can be much more efficient in terms of speed of convergence. 

We draw a tangent to the graph of the given function f at some point (xo, f(x0)), 
where xo € [a, b]. Next, we find the point x; where the tangent intersects the x-axis. 
By repeating this process, we obtain a sequence {x,} of points that quickly con- 
verges to the point x such that f(x) = 0. (It is possible to prove that each successive 
iteration leads to the doubling of the number of correct digits of x found at the 
previous step.) 

It is easy to prove analytically (check it!) that the method of tangents reduces to 
an iteration process 


fF On) 
i Ga) 


Xn+1 =Xn 
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For instance, the solution of the equation x” — a = 0, i.e., the computation of 
»/a, according to the formula above, reduces to an iteration process 


1 a 
cere (m — 1)Xn xl . 


In particular, for calculating /2 with the method of tangents, we obtain 


1 a 
Ant+1 = 5 mes s 


As can be seen from the formulas above, Newton’s method looks for fixed points of 


the mapping g(x) = x — feo It is a special case of the mapping g(x) = x —af (x), 
discussed in the first section, and we obtain it by setting a(x) = Fen 

Note that in general, the mapping g(x) = x — af (x) and even the mapping 
g(x) =x - ae involved in the method of tangents do not have to be contrac- 
tion mappings. Moreover, as can be shown by simple examples, in the case of a 
general function f, the method of tangents does not always lead to a convergent 
iteration process. 

If in the expression g(x) = x — af (x), the function a(x) can be chosen in the 
given interval so that |y’(x)| <q < 1, then the mapping g : [a,b] > [a, d], of 
course, will be a contraction. 

In particular, if ~@ can be taken as the constant 


1 


Fim)? then we obtain g(x) = x — 


; ae and g’ (x) =1— Lo . If the derivative of the function f is at least continuous 
at xo, then in some neighborhood of xo, we have |g’(x)| = |1 — en <q< If 


the function y maps this neighborhood into itself (although this is not always the 
case), then the standard iterative process induced by the contraction mapping ¢ of 
this neighborhood will lead to a unique point of the mapping ¢ in this neighborhood 
at which the original function f vanishes. 


Appendix C 
The Legendre Transform 
(First Discussion) 


C.1 Initial Definition of the Legendre Transform 
and the General Young Inequality 


We call the Legendre transform of a function f with variable x a new function f* 
with new variable x*, defined by the relation 


f*(x*) = sup(x*x — f(x), (C.1) 


where the supremum is taken with respect to the variable x, for a fixed value of the 
variable x*. 


Problem 1 


a) Check that the function f* is convex in its domain of definition. 

b) Draw a graph of the function f, the line y = kx where k = x*, and specify 
the geometric meaning of the value f*(x*). 

c) Find f*(x*) for f(x) =|x| and f(x) =x’. 

d) Prove that from the definition (C.1), it follows clearly that the inequality 


ae dl Oa Ce C9) (C.2) 


is satisfied for all values of the arguments x, x* in the domain of definition of the 
functions f and f*, respectively. 


Relation (C.2) is usually called the general Young’s inequality or the Fenchel- 
Young inequality, and the function f*, in convex analysis, for instance, is usually 
called the Young dual of the function /f. 
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C.2 Specification of the Definition in the Case of Convex 
Functions 


If the supremum involved in the definition (C.1) is attained at some inner point x 
from the domain of definition of the function f, and this function is smooth (or at 
least differentiable), then we find that 


x* = f'(x) (C.3) 


and therefore 


f*(x*) =x*x — f(x) =xf'(x) — f(x). (C4) 


Thus in this case, the Legendre transform is specified in the form (C.3), giving 
the argument x* and (C.4) providing the value f*(x*) of the function f*, i.e., the 
Legendre transform of the function f. (Notice that the operator x f’(x) — f(x) was 
studied already by Euler.) 

Moreover, if the function f is convex, then in the first place, the condition (C.3) 
will not only provide a local extremum, but also will be a local maximum (check 
it!), which in this case will be the global or absolute maximum. 

In the second place, due to the monotonic increase of the derivative of a strictly 
convex function, Eq. (C.3) is uniquely solvable with respect to x, for such functions. 

If Eq. (C.3) admits an explicit solution x = x(x*), then on substituting it in (C.4), 
we obtain an explicit expression for f*(x*). 


Problem 2 


a) Find the Legendre transform of the function tx”, for a > 1, and obtain the 
classical Young’s inequality 
1 1 
ab < —a% + —DP, (C.5) 
a B 
where 1 Df, 

b) What is the domain of definition of the Legendre transform of a smooth 
strictly convex function f having the lines ax and bx as asymptotes, for x — ++-oo 
and x — —oo respectively? 

c) Find the Legendre transform of the function e* and prove the inequality 


t 
xt<e*4+t¢lIn-. (C.6) 
e 


C.3 Involutivity of the Legendre Transform of a Function 


As we already noted, Eq. (C.2), or equivalently the inequality 


f(x) =xx* — f*(x*), (C.7) 
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holds for all values of the arguments x, x* in the domains of definition of the func- 
tions f and f*, respectively. 

At the same time, as shown by formulas (C.3) and (C.4), if x and x* are linked 
by the relation (C.3), then the last inequality, (C.7), becomes an equality, at least in 
the case of a smooth strictly convex function f. Recalling the definition (C.1) of the 
Legendre transformation, we conclude that in this case, 


(fy =F (C.8) 


So the Legendre transform of a smooth strictly convex function is involutive, i.e., 
the twofold application of this transform leads to the original function. 


Problem 3 


a) Is it true that f** = f for every smooth function f? 

b) Is it true that f*** = f* for every smooth function f? 

c) Differentiating Eq. (C.4), using (C.3) and provided that f” (x) 4 0, show that 
x = f*(x*) and therefore f(x) = xx* — f*(x*) (involutivity). 

d) Check that at the corresponding points x, x*, linked by Eq. (C.3), we have 
Pea) &) and f= — G9 R/S. 

e) The family of lines px + p* depending on the parameter p is a family of 
tangents to a certain curve (the envelope of this family). Find the equation of this 
curve. 


C.4 Concluding Remarks and Comments 


As part of the discussion about convex functions, we gave the initial presentation of 
the Legendre transform at the level of functions of one variable. However, here we 
shall facilitate the perception of the Legendre transform and work with it in a number 
of important more general cases of applications, such as in theoretical mechanics, 
thermodynamics, equations of mathematical physics, calculus of variations, convex 
analysis, contact geometry, ..., and many more yet to be dealt with. 

We shall analyze various details and possible developments of the concept of the 
Legendre transform. Here we add only the following remark. The argument of the 
Legendre transform is a derivative, or equivalently the differential of the original 
function, as is shown by Eq. (C.3). 

If the argument x is, for instance, a vector of a linear space X equipped with a 
scalar product (-, -), then the generalization of the definition (C.1) is the equation 


f* (x*) = sup((x", x) - f(x). (C.9) 


If we take x* as a linear function on the space X, i.e., assume that x* is an 
element of the dual space X* and the action x*(x) of x* ona vector x is denoted as 
before by (x*, x), then Eq. (C.9) will continue to be meaningful, since the function 
f is defined on the space X, and therefore the Legendre transform f* is defined in 
the space X*, the dual of the space X. 


Appendix D 
The Euler—MacLaurin Formula 


D.1 Bernoulli Numbers 


Jacob Bernoulli found that -">' nk = oa son CE Ban ™, where C/? = 
aim are the binomial coefficients, and Bp, B,, Bo, ... are rational numbers, now 
called Bernoulli numbers. These numbers are met in different problems. They have 
the generating function =4 = pee ae where they occur as coefficients in the 
Taylor expansion, and in this way they can be calculated. 

These numbers can also be calculated with the following recurrence formula: 


1 k+1 
Bo=1, By =—— > Ct Bn-t- 


Problem 1 Find the first Bernoulli numbers and check that all Bernoulli numbers 
with odd indices, except B;, are equal to zero, and that the signs alternate for the 
Bernoulli numbers with even indices. (The function x/(e* — 1) + x/2 is even.) 


Euler discovered the connection B, = —né(1 — n) between the Bernoulli num- 
bers and the Riemann ¢-function. 


D.2 Bernoulli Polynomials 


Bernoulli polynomials can be defined by various means. For example, Bernoulli 
polynomials are defined recursively by Bo(x) = 1, B/, (x) =nBn_-1(x), with the con- 
dition that i. B,(x) dx = 0; they are also defined through the generating function 
ke poee a through the formula B, (x) = )-7_9 C B,—«x*, or through 


n=0 7 


Ba(x) = "7-9 a Veo(-DFCk a +k)". 
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Problem 2 


a) Based on the different definitions, compute the first few Bernoulli polynomi- 
als, and check the coincidence between them and the fact that Bernoulli numbers 
are the values of the Bernoulli polynomials at x = 0. 

b) By differentiating the generating function, show that the Bernoulli polynomi- 
als B,, (x) defined through this function satisfy the recurrence relation above, which 
in turn means that 


x 
B, (x) = B, +n f B,_-\(t) dt. 
0 


D.3 Some Known Operators and Series of Operators 


Problem 3 


a) If A is an operator, then we employ the notation i as is usual for numbers, 
for referring to the operator A~!, the inverse operator of A. 

The integration operator { is the inverse of the differentiation operator D (with 
a proper setting of the integration constant). Similarly, the sum operator }* is the 
inverse of the difference operator A, whose action is defined as Af(x) = f(x + 
1) — f(x). Specify exactly how to find )> f(x). 

b) Do you agree with the fact that B, (x) = D(eP — 1)~!x"? 

c) According to Taylor’s formula, 


/ ” 2 
fo), £'@) (2 D ) Feo 


Af(x)=fx+)—f)= + +-..=(24+2 +... 


1! 2! 1! 2! 


therefore, A =e? — 1 and y= A7! = (e? — 1)~!, and since a i bear Be zk 
then 


Bo. Bi Bo B34 ey Binoy 
eG art Dip DRS +)°— Dk. 


2! 3 


D.4 Euler—MacLaurin Series and Formula 


By applying this operator relation to the function f(x), we guess the Euler— 
MacLaurin summation formula, more precisely, not the formula itself but the corre- 
sponding series 


b 0° B 1 
Le f= it frase YF Mn| 
4 k=l 


a<n<b 


where a, b, n are integers and k is a natural number. 
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These series differ in the way that the Taylor series is different from Taylor’s 
formula, which is finite and contains certain information (remainder term) providing 
the possibility of estimating the remainder (the value of the approximation error). 

When the sum is reduced to a single term, the simplest and at the same time basic 
Euler—MacLaurin formula with a remainder term has the form 


1 m 1 
fO)= Forde + Yo AE pV) se aa = sy ale )dx. 
2 ea o 


It is assumed here that the original function f is smooth enough, for example, 
that it possesses continuous derivatives of the required order. 


Problem 4 Using the formula of integration by parts, prove by induction the Euler— 
MacLaurin formula written above. (Recall that Taylor’s formula with remainder 
term of integral type can also be obtained with a simple integration by parts.) 


D.5 The General Euler-—MacLaurin Formula 


The general Euler-MacLaurin formula, providing the value of the sum ) 0 <n <p f(n), 
has the form 


Laem 


by roy= fonds ian doo) 


a<n<b 


where a, b, n are integers and k, m are natural numbers, 


m= cin f° Bal Pte) pt (x) dx, 


and {x} is the fractional part of the number x. 


Problem 5 Prove this formula, given that every interval [a, b] whose endpoints are 
integers can be divided into unit intervals (with length 1) and each unit interval can 
be translated to the interval [0, 1] with a shift. 


D.6 Applications 


Problem 6 


a) Using the Euler-MacLaurin formula and setting f(x) = x”, show that 
Doce ee pe o CK By (b™* — a”—*), and in particular, obtain Jacob 
Bernoulli’s feiion Vie” (Ss Ce Bee” *. 
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b) To calculate the asymptotic behavior of a sum or a series, usually the follow- 
ing kind of Euler-MacLaurin formula is used: 


b oo 


b B 
Yrm~f Piva arn eye ai Oat @) 


n=a @ k=1 


where a, b are integers. The formula often remains valid for the extension from the 
interval [a, b] to the whole line. In many cases, the integral on the right side can be 
calculated in terms of elementary functions, even if the sum on the left side cannot 
be expressed. Then all the terms of the asymptotic series can be expressed in terms 
of elementary functions. For example, 


+00 


‘: 1 f _ " 1 a: Box 
eee S ei 
parece sy? Jo (z+)? 222 = g2k+1 


Moreover, in this case, the integral can be calculated and is equal to i, 
c) Setting f(x) =x7!, prove the asymptotic formula 


n 


1 ae 8 B 

2k 2m+2 
ye gainntyt+ > a 
Lk ee 4 Dk" Om + Dyn 


where 0 < 6), < 1, and y is a constant (Euler’s constant). 
d) If we take f(x) = In x, show that 


n m 


Box 


1 
Ze nen eee gin +) OE Dak Runs 


where 


Bom+2 


n= Om. Ont Om + DAT 


0 < m,n < 1, and o is a constant (in fact, equal to In ¥ 277). 
By raising to powers, we can obtain the asymptotic Stirling’s formula for the 
values n! asin > oo. 


D.7 Again to the Actual Euler-MacLaurin Formula 


Problem 7 


a) If a and 7n are integers such that a <n and f is a slowly changing function at 
the interval [a, b], then the sum S = sf(a)t+ flat I+ f(a@+2)4+---+fla- 
1)+ 5 f (n) is a good approximation of the integral J = ? f(x) dx. 
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Remember this by drawing a picture showing the geometric meaning of the quan- 
tities S and J, and at the same time recalling the numerical methods for calculating 
the integral. 

b) If j is an integer, then integration by parts gives 


j+ 1 i a 1 
/ fede (: age 5) roo -{ (: =e 5) Fever, 
J iy J 


1 1 j+1 jt+l 
sf) + FU) -| F(x) dx +f ied Gide 
J 


J 


or 


where «1 (x) =x — [x] — 3 = {x} — 5. 
(Recall that [x] and {x} are the integer and fractional parts of x, respectively.) 
Summing these equalities for j from j =a to j =n — 1, we obtain 


S=I+ is (x) f’ (x) dx. 


Draw the graph of the function @. 
c) Now integrate the integral [ H ie, w(x) f’(x) dx by parts and obtain an expres- 


sion with a new integral residue te w(x) f(x) dx, where w2(x) = f (x) dx. 
Show the continuity of the function 2, given that [ 7 Hoi (x) dx = iF (x - 


5) dx = 0. Take the sum from j =a to j =n — 1, as before, and obtain in one 
step a more advanced expression for the value of S with a new integral residue. 
d) Continuing with this process, obtain the following Euler—-MacLaurin formula: 


m-1 


S=1+ VCD eH OFOC) 


s=l1 


4p! i “tm PO Wade. 


a a 


e) Check that w(x) = a BK ({x}) and compare the Euler-MacLaurin formula 
obtained with the one discussed previously. 


Appendix E 
Riemann-Stieltjes Integral, Delta Function, 
and the Concept of Generalized Functions 


Basic Background 


E.1 The Riemann-Stieltjes Integral 


Specific Objective and Some Heuristic Arguments We have considered a num- 
ber of examples of the effective use of the integral in the calculation of areas, vol- 
umes of solids of revolution, lengths of paths, work forces, energy .... We found the 
potential of the gravitational field and computed the escape velocity for Earth. Using 
the machinery of the integral calculus, we convinced ourselves of the fact that, for 
example, the path length does not depend on its parametrization. At the same time, 
we pointed out that certain calculations (e.g., the length of an ellipse) are associated 
with nonelementary functions (in this case elliptic functions). 

All the quantities mentioned above (length, area, volume, work, ...) are addi- 
tive like the Riemann integral. We know that every additive function /[a@, 6] of 
the oriented interval [a, 6] C [a, b] has the form /[a, 6] = F(8) — F(a) if we set 
F(x) =I1[a,x]+ C. In particular, we can take an arbitrary function F and define 
an additive function J[a, 6] = F(8) — F(q@) from it, considering F(x) = I[a, x]. 
If the function F is discontinuous on the interval [a,b], then the function /[a, x] 
is also discontinuous there. But then it cannot be represented as a Riemann inte- 
gral i p(t) dt of any Riemann integrable function (a density p), because such an 
integral, as we know, is continuous at x. 

Suppose, for instance, that the interval [—1, 1] is a string having a bead of mass | 
in the middle. If /[a, 6] is a mass within the interval [a, 6] C [—1, 1], then the 
function 7[—1, x] is equal to zero for | < x < 0 and equal to | for 0 <x <1. 
If we tried to describe the distribution of the mass in the segment in terms of the 
density distribution (i.e., in terms of the limit of the ratio between the mass lying 
in a neighborhood of the point and the size of the neighborhood when the latter is 
contracted to a point), then we would have to define the density p as p(x) = 0 for 
x #0 and p(x) = +00 for x = 0. Physicists and all scientists after Dirac called p a 
“function” (this distribution density). Today, we call p a “generalized function” or 
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“distribution”. We denote it by 6 and it is defined as f. va d(x)dx=lifa<0< 6 


and f? 30x) dx = 0 if a < B <0 or0 <a < #, no matter what the numbers a and 
B are. 

Of course, according to the traditional definition of the integral, e.g., the Riemann 
integral, this integral does not make sense (for the simple reason that the integrand 
is an unbounded “function”). We make liberal use of the integral symbol here. It is 
used only as a replacement of the additive function [a, 8], discussed above, when 
we considered the bead on a string. 


Example 4 (Center of mass) Recall the fundamental equation m# = F describing 
the motion of a point mass m due to the effect of a force F, where r is the radius 
vector of the point. If there is a system of n material points, then for each of them 
we have the equation m;r; = F;. By summing all these equalities, we obtain the 
equation )7;_, mir; = )7;_, Fi, which can be rewritten in the form M )°j_, 47; = 
>. Fi, or in the form Mry = F, where M = )°"_, mj, F = 7y_, Fi, and iM = 
i “ari. That means that if the total mass of the system is placed at some point 
in the space, with the radius vector being equal to ry = )~_, a rj, and under the 
influence of the force F = )~?_, F;, then the mass will move according to Newton’s 
law, no matter how complex the mutual motion of the individual parts of the system 
is. 

The point of the space we found with radius vector )~?_ iw 
of mass of the system of material points. 


Mir: is called the center 


Now suppose we have the task of finding the center of mass of a material body, 
i.e., a region D of space, in which a mass is somehow distributed. Let du be the 
volume element, the concentrated mass dm, and M the total mass of the body D. 
Then we suppose that M = /, p dm, and then the center of mass can be found with 
the formula a af pr dm, where r is the radius vector of the mass element. 

We still do not know how to integrate over domains in space, and therefore, we 
consider the one-dimensional case, which is also quite informative. Thus, instead of 
the region D, we consider the interval [a, b] of the coordinate axis R. 

Then M = reg dm, and the center of mass is found with the formula i li x dm, 
where x is the coordinate of the mass element dm, which therefore can be written 
more precisely as dm(x). 

The meaning of what we wrote evidently must be as follows. Take a partition 
P of the interval [a,b], with some marked points &; € [x;~1, x;]. To the interval 
[xi-1, x;] corresponds the mass Am;. We consider the sums ; Am;, >; &| Am;, 
and passing to the limit as the parameter A(P) of the partition tends to zero, we 
obtain respectively what is denoted by : dm and [ . xdm. 

We arrive at the following generalization of the Riemann integral. 


Definition of the Riemann-Stieltjes Integral! Let f, g be real-, complex-, or 
vector-valued functions defined over the interval [a,b] C R. Let (P,&) = (a= 


ITD. Stieltjes (1856-1894) — Dutch mathematician. 
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xo < & <x) <-+- < & < xX, =D) be a partition of this interval with marked 
points and parameter A(P). We consider the sum ae f(&) Agi, where Ag; = 
g(xi) — g@i-1). 

The Riemann-Stieltjes integral of the function f with respect to the function g 
over the interval [a, b] is defined as the following value: 


b n 
/ F(x) dg@x) =, lim ody FE) Ag: (E.1) 


if the above limit exists. 
In particular, when g(x) = x, we return to the standard Riemann integral. 


E.2 Case in Which the Riemann-Stieltjes Integral Reduces 
to the Riemann Integral 


Note also that if the function g is smooth and f is a Riemann-integrable function 
over the interval [a, b], then 


b b 
[ seem = | f(x)g'(x) dx, (E.2) 


i.e., in this case, the computation of the Riemann-Stieltjes integral reduces to the 
computation of the Riemann integral of the function fg’ on the same interval. 

Indeed, using the smoothness of the function g and the mean value theorem, we 
can rewrite the sum at the right side of Eq. (E.1) in the following form: 


>> Gi Agi = >> FE)(g@i) — i-1)) = > FE’ EG) Gi — 41-1) = 


i=1 i=1 i=l 


=> fds’ GdAx + >> f GD(8' ED - 8 (&)) Ax. 


i=l i=l 


By the uniform continuity of the function g’ on the interval [a, b] and the bound- 
edness of the function f, the last sum approaches zero as A(P) — 0. The first sum 
is the usual integral sum for the integral, which appears at the right side of (E.2). 
By our assumptions about the functions f and g, the function fg’ is Riemann in- 
tegrable over the interval [a,b]. Thus, the sum above approaches the value of this 
integral for 4(P) — 0, which completes the proof of Eq. (E.2). 


Problem 1 We achieved the proof of this equality using the mean value theorem, 
which holds for real-valued functions. Using the general finite increment theorem, 
complete the proof for vector-valued functions (for instance, complex-valued func- 
tions). 
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E.3 Heaviside Function and an Example of a Riemann-Stieltjes 
Integral Computation 


The Heaviside step function is defined through H(x) = 0 for x <0 and H(x) = 1 
for 0 < x. Let us calculate the integral 7 : f(x) dH(x). Following the defini- 
tion (E.1), we write the sum 7", f(&)) AH; = 7'_, f (&)(H (ai) — HA (xi-1). Be- 
cause of the definition of the Heaviside function, this sum is clearly equal to zero if 
the point 0 is not contained in the interval [a, b], and equal to f(x;) if the point 0 
falls into some of the intervals [x;_1,x;] (more precisely, in the interior of it or at 
its endpoint x; ). In the first case, the integral is of course zero. 

In the second case, under the limit A(P) — 0, the point &; € [x;~1, x;] ap- 
proaches 0. Therefore, if the function f is continuous at 0, then the limit of the 
sum above will be f (0). 

If the function f is discontinuous at 0, with small changes of the value of &; it is 
possible to change essentially the value of f(&;), and thus the sums of integrals will 
not have a limit for A(P). 

It is clear that the last calculation has a general nature, since the occurrence of 
joint points of discontinuity for the functions f, g involved in the Riemann-Stieltjes 
integral leads to the nonexistence of the limit if such a joint point occurs in the 
interior of the integration interval. 

Therefore, the calculation above shows that if g is, for instance, a function of 
class Co(R, R), i.e., a function defined on the whole real line and continuous, that 
is identically zero outside of some bounded set, then 


[ow dH (x) = g(0). (E.3) 


E.4 Generalized Functions 


E.4.1 Dirac’s Delta Function. A Heuristic Description 


As we previously remarked, physicists among other scientists use the delta function 
6 after its introduction by Dirac. This “function” is zero everywhere except at the 
origin, where its value is infinity. Along with this (and this is really important), 


B 
( d(x)dx=1 ifa<0<A, 
a 


and 
B 
i d(x)dx=0 ifa<6 <Oorif0<a< 6, 
a 


for all real values a and f. 
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It is natural to assume that the multiplication of the integrating function by a 
number leads to the product of this number with the integral. But then, if a function 
¢ is continuous at the origin, given that it is almost constant in some small neighbor- 
hood U (0) of the origin and that {, U(0) 6(x) dx = 1, we conclude that the following 
relation should hold: 


[ese dx = (0). (E.4) 


By comparing Eqs. (E.2), (E.3), and (E.4) and continuing this chain of findings, 
we conclude that 


H'(x) =8(x). (E.5) 


Of course, it does not fit into our classical setting. However, these considerations 
are very constructive, and if one were obliged to write a value H’(x), then one would 
write what we now write, namely 0 if x 4 0 and +00 if x =0. 


E.5 The Correspondence Between Functions and Functionals 


One possible way out of this difficulty consists in the following idea of extension 
(generalization) of the concept of “function”. 

We shall look at the function through its interaction with other functions. (As 
usual, we are not interested in the internal structure of a device, such as a human, 
and we consider that we know an object if we know how the object responds to 
certain input actions or incoming questions.) 

Take an integrable function f on the interval [a, b], and consider the functional 
Ay (a function over functions) generated by f/f: 


b 
A f(y) -|/ f(x)p(x) dx. (E.6) 


In order to simplify the technical difficulties, we shall consider smooth test functions 
even of class Cy [a, b], i.e., infinitely differentiable functions vanishing in a neigh- 
borhood of the endpoints. It is even possible to continue both functions f and g as 
zero outside the interval [a, b], and instead of writing the integral over the interval, 
we can write the integral as 


Af(y) =| reocoar. (E.7) 


Knowing the value of the functional Ay on test functions, we can find easily, 
if necessary, the value f(x) of the function f at any point where this function is 
continuous. 


Problem 2 
1 


a) Prove that the value 5- fie f(t) dt (integral average) tends to f(x) for 


€ — O at every point of continuity of the integrable function /. 
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b) Show that the step function 5,, equal to zero outside the interval [e, ¢] and 
equal to or inside this interval (the function 5, imitates Dirac’s 5 function), can 
be approximated with smooth functions 6, with these properties: d.(x) > 0 in R, 
5¢(x) =0 for |x| > ¢, and fp d.(x) dx = 1. 

c) Show that if ¢ > 0, then /, ois f (t)de(x — t)dt > f(x) at every point of 
continuity of the integrating function f. 


E.6 Functionals as Generalized Functions 


Thus, an integrable function f provides a linear functional A; (a linear function 
in the vector space of functions Cola, b] or cy (R)) defined through the formula 
(E.6) or (E.7), and moreover, with the use of the functional A ;, the integrable func- 
tion f itself can be restored at all its points of continuity (i.e., almost everywhere). 
Therefore, the functional A can be thought of as a different encoding or interpre- 
tation of the function f considered in the mirror of functionals. 

But in this mirror it is possible to find some other linear functionals that are not 
given through the integration of any function. As an example, we have the functional 
that we have studied before, ik v(x) dH (x) = g(0), which we denote by As; (given 
that we would like to write 6(x) dx instead of dH (x)). 

The functionals of the first type are called regular, while those of the second type 
are called singular. 

We shall consider functionals as generalized functions. This set of functionals 
contains our usual functions as a subset, consisting of all regular functionals. 

Thus, by relating the Riemann integral and its generalization the Riemann— 
Stieltjes integral, we gave an overview of the construction of generalized functions. 
We shall not delve into the details of the theory of generalized functions, which are 
related, for instance, to different spaces of test functions and the construction of lin- 
ear functionals (generalized functions) on them. We prefer to prove the rule for dif- 
ferentiating generalized functions. As a final remark showing the usefulness of the 
Stieltjes integral, we would like to add here that on the space C[a, b] of continuous 
functions ¢ on the interval [a, b], every linear continuous functional (either regular 
or singular) can be represented as a Riemann-Stieltjes integral i g(x) dg(x) for 
some properly selected function g. (For instance, the singular functional As repre- 
senting the generalized function 6 has the form te y(x) dH (x), shown in Eq. (E.3).) 


We began with an example in which we found the Stieltjes integral p? x dm(x) 


in the determination of the center of mass. The integral M, = { is x" dm(x) is called 
the moment of order n relative to a measure (e.g., a probability measure), mass, or 
charge distributed over the interval [a, b]. The moments Mo, M1, M2 are frequently 
met: Mo is the total mass (or measure of charge); M\/Mpo provides the center of 
mass in mechanics, and M, is the mathematical expectation (or expected value) of a 
random variable in probability theory; M2 is the moment of inertia in mechanics and 
the scattering of a random variable with expected value M, = 0 in probability the- 
ory. One of the problems of the theory of moments is the restoration of a distribution 
through the computation of the moments. 
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E.7 Differentiation of Generalized Functions 


Let A be a generalized function. What generalized function A’ should be considered 
the derivative of A? 

Let us consider first the derivative for regular generalized functions, i.e., for func- 
tionals A ¢ generated by a classical function f, in our case a smooth compactly sup- 
ported function of class c. Then it is natural to consider A f the derivative A’ f 


of Ay, generated by the function f’, the derivative of the original function f. 
Using integration by parts, we find that 


Al-(p) = Aplg) = is Foon) de = FPOD|™, — [ Fog") de = 


= — [ Fee’ as =Ay;(9’). 


Therefore, we find that in this case, 


A'-(g) =—Af(¢’). (E.8) 
This provides a reason to adopt the following definition of derivative: 
A’(g) := -A(g’). (E.9) 


It is indicated here how the functional A’ acts on a function y € Cc Therefore, 
the functional A’ is well defined. 

The action of a linear functional on a function @ is frequently written in the form 
(A, 9), instead of A(g), recalling the scalar product, to emphasize that this product 
is linear in both of its variables. 

With this notation if f is any generalized function, then according to Eq. (E.9), 
we have 


(f.g)=-(f.¢)). (E.10) 


E.8 Derivatives of the Heaviside Function and the Delta Function 


We shall compute the derivative of the Heaviside function, considering it a general- 
ized function acting according to the usual rule of regular generalized functions 


(H, ¢) = [ He ocoar. 
Following the definitions (E.9) and (E.10), we have that 
+oo 
(H’, 9) = —(H, @') = ie H(x)g'(x)dx = — / y/@)= 
0 


= —9(x)|F° = (0). 
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We have shown that (H’, y) = y(0). However, after the definition of the 5 function 
we have (6, g) = g(0). Hence, we have proved that in terms of generalized func- 
tions, we have the equality 


H'=6. 


Let us compute, for example, 5’ and 5”, i.e., we determine the action of the fol- 
lowing functionals: 


(5’, g) = —(8, 9’) := —9'(O); 
(5”, 9) a —(5', 9’ ae y” (0). 


It is clear now that in general, we have (6, y) = (—1)"y (0). 

We realize that generalized functions are infinitely differentiable. This is their re- 
markable property, which has many consequences. This property allows operations 
that with usual functions are possible only under very special conditions. 

To conclude, we would like to make the following remark of a general nature. Let 
X bea vector space and X™* its dual space, consisting of linear functions on X, and 
let X** be the dual space of X*. We shall write the value x*(x) of the function x* € 
X* at the vector x € X as a scalar product (x*, x), as we did before. By fixing x, 
we obtain a linear function with respect to x*. Thus every element of X can be 
interpreted as an element of X**, i.e., we have an embedding J : X > X**. In the 
finite-dimensional case, all the spaces X, X*, and X** are isomorphic, and J (X) = 
X**. In the general case, 1(X) ¢ X**, i.e., 1(X) is only a subset of the whole 
space X**. This is what is observed in the transition from functions (corresponding 
to regular functionals) to generalized functions, which turned out to be a larger 
space. 


Appendix F 
The Implicit Function Theorem 
(An Alternative Presentation) 


F.1 Formulation of the Problem 


The formulation of the problem and the heuristic arguments are discussed, of course, 
in a course lecture; but we shall omit this here, since the relevant material can be 
read in Sect. 8.5 of Chap. 8. 

We shall use a different approach for the proof of the implicit function theorem 
here, splendid and independent from that we presented in Sect. 8.5 of Chap. 8. This 
theorem assumes a somehow more advanced audience of readers, despite its con- 
ceptual simplicity, beauty, and generality. These readers are in general already more 
familiar with some general mathematical concepts, presented at the beginning of the 
second part of the textbook. In any case, all this information allows us to appreciate 
the real generality of the method, which we can show without loss of generality on 
simple visual examples in our familiar spaces. 


F.2 Some Reminders of Numerical Methods to Solve Equations 


By fixing one of the variables in the equation F(x, y) = 0, we obtain an equation in 
terms of the other variable. Therefore, it might be useful to remember how to solve 
equations f(x) =0. 

1) According to the properties of the given function f, one chooses the methods 
of solution. 

For instance, if the function f is real-valued, continuous, and taking values with 
different signs at the endpoints of the interval [a, b], then we know that in this in- 
terval there is at least one root of the equation f(x) = 0, and it is possible to find it 
through successive divisions of the interval. By dividing the interval in half, we get 
either the root or a half-interval where the function takes values with different signs 
at the endpoints. Continuing with this dividing process, we obtain a sequence (the 
endpoints of the intervals) converging to the root of this equation. 
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2) If f is a smooth convex function, then following Newton’s method, it is pos- 
sible to propose in this case a more efficient algorithm, in the sense of speed of 
convergence, to find a root. 

Newton’s method or the method of tangents works as follows. We build the tan- 
gent at the point xo, we find the intersection point with the x-axis, and by repeating 
this process, we obtain a sequence of points with a recurrence relation 


Xnt1 =n — (F'n) f On) (F.1) 


converging rapidly to the root. (Estimate the velocity of convergence. Obtain the 


relation xX,41) = 5 (Xn +a/x,), allowing you to find the positive root of the equation 


x? —a=0. Find V2 according to this formula with the desired accuracy and detect 


how many additional correct digits appear at each step.) 
3) Equation (F.1) can be written in the form 


Xnt+1 = g(Xn), (F.2) 


where g(x) = x — (Gea) mur ata? Thus, finding the roots of the equation for f(x) 
reduces to finding a fixed point of the mapping g, i.e., a point such that 


x = g(x). (F.3) 


This reduction, as we know, applies not only to Newton’s method. In fact, the equa- 
tion f(x) = 0 is equivalent to the equation Af (x) = 0 (if 27! exists), and that is 
equivalent to the equation x = x +Af(x). Setting g(x) =x +Af(x) (A can be a 
variable here), we arrive at Eq. (F.3). 

The process of solution of (F.3), i.e., finding a fixed point of the mapping g in 
accordance with the recursive formula (F.2), is called an iterative process or method 
of iterations, as we already know. This means that the value found in the previous 
step becomes the argument or input of the function f in the next step. This cyclic 
process is suitable for implementation in a computer. 

If the iteration process (F.2) is done in a region where |g’(x)| < q < 1, then the 
sequence 


x0, 
x1 = g(x0), 


x2 = g(x1) = 87(X0), 


Xn41 = 8(Xn) = 8" (Xo) 


is always fundamental (or a Cauchy sequence). Indeed, by applying the mean value 
theorem, we have 


IXn41 —Xnl <@|Xn —Xn-1| S++ S q"|x1 — xol. (F.4) 
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We apply the triangle inequality to it, and we obtain 


Xn —Xpn| S [Xn — Xn+1\ Spies Xntm—1 = Xn+m| < 


S (gh tet gh lei — x01 = 7 qi xol 5) 
It is useful to remark that if we take the limit m — oo in the last inequality, we 
obtain the estimate 
q” 
|x —Xn| < lx1 — xol, (F.6) 
Lg 
the deviation or evasion of x, from the fixed point x. 


Problem 1 Draw several variants of the curve y = g(x) intersecting the line y = x 
and a diagram simulating an iterative process Xn+1 = g(Xn) for finding a fixed point. 


F.2.1 The Principle of the Fixed Point 


The last arguments (relating formulas (F.3)-(F.6)) can obviously be applied in any 
metric space in which the Cauchy criterion is valid, i.e., where every fundamen- 
tal sequence is convergent. Such metric spaces are called complete metric spaces. 
For instance, R is a complete metric space with respect to the standard distance 
d(x’, x") = |x’ — x"| between points x’, x” € R. The interval J = {x € R| |x| < 1} 
is also a complete metric space with respect to this metric. If we remove a point 
from R or /, then clearly, the resulting metric space will not be complete. 


Problem 2 


a) Prove the completeness of the spaces R”, C, C”. 

b) Show that the closed ball B(a,r) = {x € X | d(a, x) <r} with radius r and 
center a € X in a complete metric space (X, d) is itself a complete metric space 
with respect to the induced metric d from the embedding BC X. 


We recall now the following definition. 


Definition 1 A mapping g : X — Y from a metric space (X,dyx) into another 
(Y, dy) is called a contraction if there exists a number gq € [0, 1[ such that for arbi- 
trary points x’, x” € X, 


dy(g(x'), g(2”)) Sadx(x’, x"). 


For example, if g: R — R is a differentiable function with the property that 
everywhere |g’(x)| <q < 1, then by the mean value theorem, we have |g(x’) — 
g(x”)| < q|x’ — x" |, and therefore g is a contraction mapping. The same can be 
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said about a differentiable mapping g : B — Y from a convex subset B of a normed 
space X (for instance, the ball B Cc R”) to a normed space Y if ||g’(x)|| < q at every 
point x € B. 

We are able to formulate now the following fixed-point principle. 

A contraction mapping g : X — X of a complete metric space into itself has a 
unique fixed point x. 

This point can be found through the iterative process Xn+1 = g(Xn), starting with 
any point xy € X. The speed of convergence and the error estimate for the approxi- 
mation are given by the inequality 


g” 
—4 


d(x}, xo). (F.6’) 


dO, Xn) SF 


The proof of this fact was given above by the deduction of formulas (F.4)-(F.6), 
where instead of |x’ — x’’| we have to write everywhere d(x’, x”). 

In order to appreciate the usefulness and scope for generalizing this principle, 
consider the following important example. 


Example I We look for the function y = y(x) satisfying the differential equation 
y’ = f(x, y) and the initial condition y(xo) = yo. 

Using the formula of Newton—Leibniz, we rewrite the problem in the form of the 
following integral equation for the unknown function y(x): 


y(x) =yo+ / f(t, y@) dt. (F7) 


xo 


On the right-hand side there is a mapping g, which acts on the function y(x), and 
we look for the fixed “point” of the mapping (action) g. 

For example, let f(x, y) = y, x9 = 0, and yo = 1. Then we deal with the solution 
of the equation y’ = y with the initial condition y(0) = 1, and Eq. (F.7) takes the 
form 


ya) =14 [ y(t) dt. (E8) 
0 


We then carry out the iterative process, starting with the function yo(x) = 0, and we 
successively obtain 


yi(x) =1, 


x 
yo(x) =1 +f yi(t)dt=1+x, 
0 


x 


a x 1 
a) =1+ f yadr=1+ f (+ ndsitrt 5x, 
0 0 
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1 1 , 
yn) =14+ —x+-°+—2'","7 
1! n! 


It is clear, that we obtain the function e* = | + ux fewer t ax pees, 


Problem 3 Show that if || f(x, v1) — f(x, y2)|| < M|ly1 — yz, then in a neigh- 
borhood of the point xo the iteration process is applicable in the case of the more 
general equation (F.7). 


In this way, Picard (Emile Picard, 1856-1941) was looking for the solution of the 
differential equation y’(x) = f(x, y(x)), with the initial condition y(x9) = yo as a 
fixed point of the mapping (F.7). 

Banach (Stefan Banach, 1882-1945) formulated the fixed-point principle in the 
abstract form above, and in this form it is often called Banach’s fixed-point principle 
or the Banach—Picard principle. However, its origins can be traced back to Newton, 
as we have seen. 


F.3 The Implicit Function Theorem 


F.3.1 Statement of the Theorem 


We return now to the main object of our consideration and prove the implicit func- 
tion theorem. 


Theorem Let X,Y,Z be normed spaces (for example, R™,R",R™” or even 
R, R, R), and suppose moreover that Y is a complete metric space with respect 
to the metric induced by the norm. Let F : W — Z be a mapping defined in a neigh- 
borhood W of the point (xo, yo) € X x Y, continuous at (x0, yo), together with the 
partial derivative ae (x, y), which is supposed to exist in W. If F (xo, yo) = 0 and 
there exist (Fy (xo, yo))7! and ||(FY (xo, yo))7! || < 00, then there exist a neighbor- 
hood U = U (xo) of the point xo in X, aneighborhood V = V(yo) of the point yo 
in Y, and a function f : U — V, continuous at xo, such that U x V Cc W and 


(F(x, y) =0 within U x V) & (y= f(x), x €U). (F.9) 


In short, under the conditions of the theorem, the set determined by the relation 
F(x, y) = 0 within the neighborhood U x V is the graph of the function y = f(x). 
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F3.2 Proof of the Existence of an Implicit Function 


Proof Without loss of generality and for brevity, we may assume that (xo, yo) = 
(0, 0), which can always be achieved by the change of variables x — xo and y — yo. 

For a fixed x we shall solve the equation F(x, y) = 0 with respect to y. We look 
for the solution as the fixed point of the mapping 


gx(9) =y —(F,0, 0) F(x, y). (F.10) 


This is a simplified version of Newton’s formula (F.1), where the coefficient 
is constant (see the paragraph following formula (F.3)). It is immediately clear that 
F(x, y)=0S gxr(y=y. 

The mapping (F.10) is a contraction if (x, y) is near (0,0) € X x Y. Indeed, 


dgx 
dy 


(y) = E—(F,,0)) Fy, y). (Fl) 


Here F is the identity (unitary) mapping, and since F. (x, y) is continuous at the 
point (0, 0), there exists a number A € R such that for ||x|| < A and ||y|| < A, 


dgx 
dy 


1 
=. F12 
= (F.12) 


Finally, note that for every ¢ € JO, A[ there is 6 € ]0, A[ such that if ||x|| < 6, 
then the function g, maps the interval (ball) ||y|| < ¢ into itself. 

Indeed, because of F(0, 0) = 0 and from Eq. (F.10), we have go(0) = 0. In view 
of the continuity of F at the point (0, 0), it follows from (F.10) that there is 6 € 0, A[ 
such that || g,(0)|| < 5é for ||x|| <6. 

Thus, for ||x|| < 6, the mapping g, : B(e) — Y displaces the center of the in- 
terval B(e) = {y € Y | ||y|| < e} no more than 5é. Therefore, by virtue of (F.12) it 
decreases B(e) by at least a factor of two. Hence, g,(B(e)) C B(e) for ||x|| <6. 

By assumption, Y is a complete space, and therefore B(e) C Y is also a complete 
metric space (with respect to the induced metric). 

Then by virtue of the fixed-point principle, there is a unique point y = f(x) € 
B(e) that is fixed for the mapping gy : B(e) > Be). 

Thus for every x with ||x|| <6, we have found a unique value y = f(x) 
(|| f (x) || < €) in the neighborhood B(e) such that F(x, f(x)) =0. 

(The cross section of the domain P = {(x, y) € X x Y | ||x|| <4, |||] < e} pass- 
ing through the point (x, 0) is the interval (ball) B(€) in which lies the corresponding 
fixed point y = f(x).) 

Thus, we have shown that 


(F(x, y) =0 for [|x|] <6 and |ly|| <¢) & (y= f(a) for |lx|| <8). (E13) 


Note that not only have we obtained the relation (F.9), but also, by virtue of the 
construction, for every ¢ € JO, A[ we can choose 6 > 0 such that (F.13) holds. Since 
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the function f has been found already and is fixed, we have that f(0) = 0 and f is 
continuous at x = 0. 


The theorem just proved can be regarded as the existence theorem of the implicit 
function y = f(x). 

We shall see now what properties of the function F are inherited by the func- 
tion f. 


F-3.3 Continuity of an Implicit Function 


If in addition to the conditions of the theorem, we know that the functions F and 
va are continuous not only at the point (xo, yo) but also in some neighborhood of 
this point, then the implicit function is also continuous in some neighborhood of this 
point. 


Proof Indeed, in this case, the conditions of the theorem will be fulfilled at all the 
points of the set F(x, y) = 0 near (x0, yo), and each of them could be considered 
a starting point (xo, yo). The function f has been found already, and therefore is 
fixed. 

Warning! Recall the exercise that if the mapping At» A7!, where A is mapped 
to its inverse (for example for a matrix A) is defined on A, then it is defined on a 
neighborhood of A. 


F3.4 Differentiability of an Implicit Function 


If in addition to the conditions of the theorem, we know that the function F is dif- 
ferentiable at the point (xo, yo), then the implicit function f is also differentiable at 
the point xo, and moreover, 


f' (x0) = —(F4(x0. yo) | FL (0, Yo). (F.14) 
Proof Given the differentiability of F at the point (xo, yo), we can write 
F(x, y) — F(xo, yo) = 
= F/(x0, yo)(x — x0) + F, (x0, yo)(y — yo) + o(|x — xol + Ly — yol)- 


Assuming for simplicity (x9, yo) = (0, 0) and considering that we are only moving 
along the curve y = f(x), we obtain 


0 = F,(0, 0)x + F,(0, 0)y + o(|x| + yl), 
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or 
y =—(F4(0, 0)" F/(0, O)x — (F/O, 0))'o(Ix1 + [yIl). (F.15) 


Since y= f(x) = f(x) — f (0), the formula (F.14) will be justified, if we can show 
that in the limit x — 0, the second term on the right-hand side of (F.15) is o(x). 
But 


| (F/O, 0))~"o(Ix1 + Lyl) | < | (E40) "|| - Jo(lal + Ly) | = o(lal + Ly). 
Further, 
|(F.@.0)F0,0]] < |(FO,0) "| -@,0)| =a <ow. 


Therefore, from (F.15) we obtain that |y| < a|x|+a(|x|+|y|), where y= f(x) — 0 
and a = a(x) — 0 for x — 0. Hence, 


at+a 
Wls5 


|x| < 2a|x| 
—a 


for x sufficiently close to 0. Given this, for x + 0 we obtain from (F.15) that 


f(x) =—(F,(0,0)) "FLO, 0)x + 0(x). 


In view of f (0) = 0, we have (F.14). 


F3.5 Continuous Differentiability of an Implicit Function 


If in addition to the conditions of the theorem, we know that the functions Fi. and 
ie are defined and continuous in some neighborhood of the point (xo, yo), then the 
implicit function f is also continuously differentiable in some neighborhood of the 
point xo. 

In short, if F € C), then f is also in ch, 


Proof In this case, the conditions of differentiability of f and (F.14) are fulfilled 
not only at (xo, yo) but at all points of the “curve” F(x, y) = 0 near (xo, yo) (see 
the above cautionary “Warning!”’). Then, according to formula (F.14), in a neigh- 
borhood of the point xo, 


f')=-(Fi(x, f@))) F(x, £@)), (F.14) 


from which it is clear that f’ is continuous. 


Warning! Recall that the mapping A> A™! is continuous. 
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F3.6 Higher Derivatives of an Implicit Function 


If in addition to the conditions of the theorem, we know that the function F is of 
class C™ in some neighborhood of the point (xo, yo), then the implicit function f 
is also of class C in a neighborhood of the point xo. 


Proof Suppose, for example, that F is of class C®. Since f is of class C“, the 
right-hand side of the equality (F.14’) can be differentiated according to the differ- 
entiation rule for composite functions (chain rule). We obtain then a formula for 
f(x), and from it the continuity of f” (x) follows. 

Moreover, as on the right-hand side of formula (F.14’), for f’(x) the first partial 
derivatives of F and the function f itself (but not f’) are involved; in the formula 
for f” (x), the second partial derivatives of F, f, and f’ are involved (but not f”). 

Thus, if F is of class C®), then we can differentiate f(x), and we arrive again 
at a formula for f’” (x) in which are involved the third partial derivatives of F, and 
also the derivatives of the functions f (f, f’, f”) of order less than three. 

By induction, we obtain what we claimed. 


Warning! Recall that the mapping A +> A7! is differentiable and even infinitely 
differentiable. 


Problem 4 


a) Find f” (x) (write down the formula for the computation of f”(x)(A1, h2) for 
given displacement vectors hj, h2). 

b) What does the formula (simplified) for f” (x) look like in the case that x, y, 
and z = F(x, y) are real or complex variables? 


Problem 5 (Method of undetermined coefficients). Suppose that we know the first 
(or all) coefficients of the Taylor series of the function F. Find the first (or all) 
coefficients of the Taylor series of the implicit function f. 


Problem 6 


a) Write in coordinate form the formulation of the implicit function theorem for 
the cases F : R” x R” — R”, when m =n = 1 and whenn > 1. 

b) Let F : R” > R” (m > n) be a linear mapping with maximal rank (= 7). 
What is the dimension of the subspace F~!(0) C R” and what is its codimension? 
Let F :R” — R” (m > n) be now an arbitrary smooth mapping, F(0) = 0 and 
rank F’(x) =n. Answer the same questions (dim F-!(0) = ?, codim F~!(0) = ?) 
with respect to the set F —1(0). 
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